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p-hydroxyphenyl pyruvic acid dioxygenase. and a genetic construct containing a DNA sequence coding for the expression of Arabidopsis 
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5 CLONED PLANT P-HYDROXYPHENYL PYRUVIC ACID DIOXYGENASE 

This invention was made with Government support 
under Grant Number 93373069083 awarded by the U.S. 
Department of Agriculture. The Government has certain 
rights in the invention. 

10 Field Of The Invention 

The present invention relates to a molecular 
approach for modifying the synthesis of vitamin E, 
plastoquinone , and carotenoids in plants by use of a 
full-length cloned cDNA which encodes a p- 

15 hydroxyphenyl pyruvic acid dioxygenase enzyme. 

Background Of The Invention 
The chloroplasts of higher plants contain many 
unique, interconnected biochemical pathways that 
produce an array of secondary metabolite compounds 

20 which not only perform vital functions within the 
plant but are also important from agricultural and 
nutritional perspectives. Three such secondary 
metabolites are the lipid soluble, chloroplastically 
synthesized compounds vitamin E (a -tocopherol or 

25 a-toc) , plastoquinones (PQ) , and carotenoids, which 

together perform many crucial biochemical functions in 
the chloroplast . PQ and vitamin E are quinone 
compounds synthesized by a common pathway in the 
plastid; carotenoids are tetraterpenoids synthesized 

30 by a separate plastid-localized pathway. 

Plastoquinone (PQ) often accounts for up to 50% 
of the total plastidic quinone pool in green tissues. 
The primary function of PQ is as a fundamental 
component of the photosynthetic electron transport 

35 chain, acting as an electron carrier between 
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photosystem II and the cytochrome b^f complex. PQ 
likely has other less well studied functions in 
plastids, namely in acting as a direct or intermediate 
electron carrier for a variety of other biosynthetic 
5 reactions in the chloroplast . 

Vitamin E is the second major class of 
chloroplast ic quinones, accounting for up to 4 0% of 
the quinone pool in plastids. The essential 
nutritional value of tocopherols was recognized around 
10 1925, and the compound responsible for Vitamin E 

activity was first identified as a-tocopherol in 1936. 
a-Toc has a well -documented role in mammals as an 
antioxidant, and a similar, though less well 
understood antioxidant role in plants. Liebler, et 
15 al., Toxicolocrv 23:147-169, 1993 ; Hess, Anti -oxidants 
in Higher Plants , CRC Press: 111-134, 1993. 

Carotenoids are a separate, diverse group of 
lipophilic pigments synthesized in plants, fungi, and 
bacteria. In photosynthet ic tissues, carotenoids 
20 function as accessory pigments in light harvesting and 
play important roles in photo-protection by quenching 
free radicals, singlet oxygen, and other reactive 
species. Sief ermann-Harms , Physiol . Plantarum . 69:561- 
568, 1987. In the plastids of non-photosynthet ic 
25 tissues, high levels of carotenoids often accumulate 
providing the intense orange, yellow, and red 
coloration of many fruits, vegetables, and flowers 
{Pfander, Methods in Enzvm. . 213A, 3-13, 1992). In 
addition to their many functions in plants, 
30 carotenoids and their metabolites also have important 
functions in animals, where they serve as the major 
source of Vitamin A (retinol) , and have been 
identified as providing protection from some forms of 
cancer due to their antioxidant activities. Vitamin 
35 E's antioxidant activities are also thought to protect 
against some forms of cancer, and may act 
synergistically with carotenoids in this regard. 
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Liebler, et al . , Toxicology 23:147-169, 1993; Krinsky, 
J. Nutr. 119:123-126, 1989. 

Tocopherol and Plastoquinone Synthesis 

cy-Tocopherol and plastoquinone are the most 
5 abundant quinones in the plastid and are synthesized 
by the common pathway shown in Figure 1 . The 
precursor molecule for both compounds, homogentisic 
acid (HGA) , is produced in the chloroplast from the 
shikimic acid pathway intermediate p-hydroxyphenyl 
10 pyruvic acid (pOHPP) , in an oxidat ion/decarboxylat ion 
reaction catalyzed by the enzyme p-hydroxyphenyl 
pyruvic acid dioxygenase (pOHPP dioxygenase) . 
Homogentisic acid is subject to 

phytylation/prenylation (phytylpyrophosphate and 
15 solanylpyrophosphate, Cjo and C45, respectively) coupled 



20 



25 



to >a-simultaneous decarboxylation by a phytyl/prenyl 
transferase to form the first true tocopherol and - 
plastoquinone intermediates, 
2 -demethylphytylplastoquinol and 

2-demethylplastoquinol-9 , respectively. A single ring 
methylation occurs on 2 -demethylplastoquinol to yield 
plastoquinol- 9 (PQHj) which is then oxidized to 
plastoquinone- 9 (PQ) . This oxidation is reversible 
and is the basis of electron transport by 
plastoquinone in the chloroplast . 



The preferred route, as established in spinach, 



30 



35 



for a-tocopherol formation from 

2 -demethylphytylplastoquinol appears to be 1) ring 
methylation of the intermediate, 2-a- 
demethylphytylplastoquinol , to yield 
phytylplastoquinol , 2) cyclization to yield 
d- tocopherol and, finally, 3) a second ring 
methylation to yield a-tocopherol. Ring methylation 
in both tocopherol and plastoquinone synthesis is 
carried out by a single enzyme that is specific for 
the site of methylation on the ring, but has 
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relatively broad substrate specificity and 
accommodates both classes of quinone compounds. This 
methylation enzyme is the only enzyme of the pathway 
that has been purified from plants to date. 
5 d'Harlingue, et al . , J.Biol.Chem. 26:15200. 1985. All 
enzymatic activities of the a-toc/PQ pathway have been 
localized to the inner chloroplast envelope by cell 
fractionation studies except for pOHPP dioxygenase and 
the tocopherol cyclase enzyme. Difficulties with cell 
10 fractionation methods, low activities for some of the 
enzymes, substrate stability and availability and 
assay problems, make studying the pathway 
biochemically difficult. 

Vitamin E and PQ levels, ratios, and total 
15 amounts vary by orders of magnitude in different 
plants, tissues and developmental stages. Such 
variations indicate that the vitamin E and PQ pathway 
is both highly regulated and has the potential for 
manipulation to modify the absolute levels and ratios 
20 of the two end products. The pathway in Figure 1 

makes it clear that production of homogentisic acid by 
pOHPP dioxygenase is likely to be a key regulatory 
point for bulk flow through the pathway, both because 
HGA production is the first committed step in a-toc/PQ 
25 synthesis, and also because the reaction is 

essentially irreversible. Therefore modifying the 
levels of HGA by modifying pOHPP dioxygenase activity 
" should have a direct impact on the total a-toc/PQ 
biosynthetic accumulation in plant tissues, and, as 
30 described below, because of the connection of PQ and 
carotenoid synthesis, should also affect carotenoid 
synthesis in plant tissues. 

Carotenoid Biosynthesis; Quinones as Electron Carriers 

In plants, carotenoids are synthesized and 
35 accumulate exclusively in plastids via the pathway 
shown on the left-hand side of Figure 1. The first 
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committed step in carotenoid synthesis is the 
condensation of two molecules of the Cjo hydrocarbon 
geranylgeranyl pyrophosphate (GGDP) by the enzyme 
phytoene synthase, to form the colorless C^q 
5 hydrocarbon, phytoene. In oxygenic photosynthet ic 
organisms (e.g. plants, algae, and cyanobacteria) , 
phytoene undergoes two sequential desaturation 
reactions, catalyzed by phytoene desaturase, to 
produce carotene through the intermediate 

10 phytofluene. Subsequently, f -carotene undergoes two 
further desaturat ions , catalyzed by f -carotene 
desaturase, to yield the red pigment lycopene . 
Lycopene is cyclized to produce either a-carotene or 
)3-carotene, both of which are subject to various 

15 hydroxylation and epoxidation reactions to yield the 
carotenoids and xanthophylls most abundant in 
photosynthetic tissues of plants, lutein, /?-carotene, 
. violaxanthin and neoxanthin . 

The genes encoding the first two enzymes of the 

20 carotenoid pathway (phytoene synthase and phytoene 
desaturase) have been isolated and studied from a 
number of plant and bacterial sources in recent years . 
Sandmann, Eur. J. Biochem. 223:7-24, 1994. Phytoene 
desaturase has been the most intensively studied, both 

25 because it is a target for numerous commercially 

important herbicides, and also because the phytoene 
desaturation reaction is thought to be a rate limiting 
step in carotenoid synthesis. Molecular and 
biochemical studies suggest that two types of phytoene 

3 0 desaturase enzymes have evolved by independent 
evolution: the crtJ-type found in anoxygenic 
photosynthetic organisms (e.g. Rhodobacter and 
Erwinia) , and the pds- type found in oxygenic 
photosynthetic organisms. Despite their differences 

3 5 in primary amino acid sequence, all phytoene 

desaturase enzymes contain a dinucleotide binding 
domain (FAD or NAD/NADP) , which in Capsicum annum has 
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been shown to be FAD. Hugueney et al., Eur. 



Riochem. 209:399-407, 1992. Presumably, the bound 
dinucleotide in both types of phytoene desaturase 
enzymes is reduced during desaturation and reoxidized 
5 by an unknown reductant present in the plastid or 
bacterium. 

Several lines of evidence have suggested a role 
for quinones in the phytoene desaturation reaction in 
higher plants. Using isolated daffodil chromoplasts , 
10 Mayer and co-workers demonstrated that in an anaerobic 
environment, oxidized artificial quinones were 
required for the desaturation of phytoene while 
reduced quinones were ineffective. Mayer et al . , Eur. 
.T. Biochem. 191:359-363, 1990. Further supporting 
15 evidence comes from studies with the triketone class 

of herbicides (e.g. Sulcotrione) , which cause phytoene 
accumulation in treated tissues but unlike the well- 
studied pyridazone class (e.g. Norflorazon (NFZ) ) do 
not directly affect the phytoene desaturase enzyme. 
20 Rather, triketone herbicides competitively inhibit 

pOHPP dioxygenase, an enzyme common to the synthesis 
of both plastoquinone and tocopherols, suggesting that 
one or more classes of quinones may play a role in 
carotenoid desaturation reactions. Schulz et al . , 
25 FEES 318:162-166, 1993; Secor, Plant Phvsiol. 106: 

1429-1433; Beyer et al . , lUPAC Pu re and Applied 
- nhemistrv 66:1047-1056, 1994, 

Despite the well-studied, wide-spread importance 
of vitamin E, plastoquinone, and carotenoids to human 
30 nutrition, agriculture, and biochemical processes 

within plant cells, much remains unclear about their 
biosynthesis and accumulation in plant tissues. This 
uncertainty has in turn limited the potential for 
manipulation of the synthesis and levels of these 
35 important compounds in plants. 
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Summary of the Invention 
In one en\bodiment, this invention provides a 
biologically pure sample of DNA which DNA comprises a 
sequence coding for the expression of Arabidopsis 
5 thaliana p-hydroxyphenyl pyruvic acid dioxygenase . 

In other embodiments, this invention provides a 
vector and microbial host containing a DNA sequence 
sufficiently homologous to SEQ ID NO:l so as to code 
for the expression of Arabidopsis thaliana p- 
10 hydroxyphenyl pyruvic acid dioxygenase, and a genetic 
construct containing a DNA sequence sufficiently 
homologous to SEQ ID NO : 1 so as to code for the 
expression of Arabidopsis thaliana p-hydroxyphenyl 
pyruvic acid dioxygenase, together with a promoter 
15 located 5' to the DNA coding sequence and a 3' 
termination sequence . 

rln another embodiment, this invention provides a 
method of creating a transgenic plant in which the 
levels of the pOHPP dioxygenase enzyme are elevated 
20 sufficient such that production of plastoquinones , 
vitamin E, and carotenoids are modified. 

It is an object of the present invention to 
genetically engineer higher plants to modify the 
production of plastoquinones, vitamin E, and 
25 carotenoids. 

It is another object of the invention to provide 
transgenic plants that would express elevated levels 
of the pOHPP dioxygenase enzyme which would have 
resultant elevated resistance to the triketone class 
30 of herbicides (i.e. sulcotrione) . 

It is another object of the present invention to 
provide a method for the preparation of the enzyme p- 
hydroxyphenyl pyruvic acid dioxygenase (pOHPP 
dioxygenase) , an enzyme which can be used to identify 
35 new pOHPPdioxygenase- inhibiting herbicides. 

Other features and advantages of the invention 
will be apparent from the following description of the 
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B-riFif Description of thf ? Drawings 
Fig. 1 is a diagram of the pathways for synthesis 
of carotenoids. vitamin E (tocopherol), and 
plastoquinone . 

Fig. 2 is a diagram of the interconnections of 

the pathways illustrated in Fig. 1- 

Fig. 3A-3E are graphs of pigment analyses of 
wild- type, NFZ-wt, and pdsl tissues. 

Fig. 4 is a physical map of the pdsl mutation 
relative to visible markers. 

Figs. 5A-5C present the results of C18 HPLC 
separation of lipid soluble pigments from wild-type 
plants on MS2 media, homozygous pdsl mutants on MS 2 
media supplemented with pOHPP, and homozygous pdsl 
mutants on MS2 media supplemented with homogentist ic 

acid (HGA) . 

Figs. 6A-6B present the results of C8 HPLC 
analyses of quinones in NFZ-wt and pdsl tissues. 

T^^f^-il^H nescr -intion Of Th^ Invention 
As described above, both Vitamin E, 
plastoquinones and carotenoids are synthesized and 
accumulated in plastids by the pathways shown in 
Figure 1. This specification describes the 
identification, isolation, characterization and 
functional analysis of a higher plant pOHPP 
dioxygenase cDNA, its role in a-toc, PQ and carotenoid 
synthesis, and the use of this cDNA to modify pOHPP 
dioxygenase activity in plant tissues and hence the 
accumulation of one or more of the compounds 
plastoquinones, vitamin E, and carotenoids in plant 
tissues. The overexpression of pOHPP dioxygenase in 
transgenic plants will modify the enzyme- to- inhibitor 
ratio of plant tissues exposed to triketone 
35 herbicides, as compared to non- transgenic plants. 
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resulting in increased herbicide resistance. The 
present specification also describes a genetic 
construct for use in the production of pOHPP 
dioxygenase, an enzyme useful in identifying new pOHPP 
5 dioxygenase-inhibit ing herbicides . 

By genetic analysis the present inventors have 
shown that the vitamin E, plastoquinone , and 
carotenoid biosynthetic pathways are interconnected 
and share common elements as shown in Figure 2 . From 

10 mutational studies in Arabidopsis thaliana , the 
present inventors identified one genetic locus, 
designated pdsl (pds= ghytoene desaturation) , the 
disruption of which results in accumulation of the 
first carotenoid of the carotenoid biosynthetic 

15 pathway, phytoene . Surprisingly, though this mutation 
disrupts carotenoid synthesis and was originally 
identified on this basis, it does not map to the locus 
encoding the phytoene desaturase enzyme . Evidence 
indicates that pdsl defines a second gene product in 

20 addition to the phytoene desaturase enzyme, necessary 
for phytoene desaturation and hence carotenoid 
synthesis in higher plants. This gene product proved 
to be pOHPP dioxygenase . 

To provide a molecular mechanism for manipulating 

25 synthesis and accumulation of the compounds 

plastoquinone, vitamin E, and carotenoids, the present 
inventors used a molecular genetic approach, taking 
advantage of the model plant system Arabidopsis 
thaliana to define, isolate and study genes required 

30 for synthesis of the compounds in plants. The 

flowering plant Arabidopsis thaliana has come into 
wide use as a model system to explore the molecular 
biology and genetics of plants. Arabidopsis offers 
many advantages for genetic analysis: it can be selfed 

3 5 and very large numbers of progeny can be obtained (up 
to 10,000 seeds from a single plant) . Furthermore, 
Arabidopsis has a short generation time of five to six 
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weeks, so crosses can be set up and the progeny 
analyzed within reasonable periods of time. Mutation 
screens have identified thousands of mutations 
affecting many aspects of basic plant biology, 
5 including morphogenesis, photosynthesis, fertility, 

starch and lipid metabolism, mineral nutrition, an so 
on. In addition, its haploid genome is only about 10® 
base pairs. 

An important aspect of the successful approach 
10 used here is that essential components were first 
functionally defined genetically, prior to their 
isolation, analysis and molecular manipulation. 
Briefly, potential mutants were identified by a 
combination of phenotypic and biochemical screening, 
15 characterized at the genetic and molecular levels, 

loci of interest selected, and the corresponding genes 
then cloned and studied further. By this approach, 
the inventors genetically defined and isolated cDNAs 
for one gene, pdsl, whose mutation disrupts synthesis 
20 of all three classes of compounds in the plastid, 

tocopherols, plastoquinones and carotenoids . Based on 
biochemical analysis of the pdsl mutant, the pdsl gene 
was identified as affecting the activity of pOHPP 
dioxygenase, a crucial enzyme of the plastidic quinone 
25 pathway in plants (Figure 1) , that is directly 
required for the synthesis of plastoquinone and 
a-tocopherol and indirectly for carotenoid synthesis. 
In particular, the deduced function of the pdsl mutant 
and pOHPP dioxygenase enzyme are noted in Figure 2 . 
30 The present inventors demonstrated by biochemical 

complementation that the pdsl mutation affects the 
enzyme p-hydroxyphenyl pyruvic acid dioxygenase (pOHPP 
dioxygenase) , because pdsl plants can be rescued by 
growth on the product but not the substrate of this 
35 enzyme, homogentisic acid (HGA) and 

p-hydroxyphenylpyruvate (pOHPP) , respectively. pOHPP 
dioxygenase is the key branch point enzyme and 
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committed step in the synthesis of both Vitamin E and 
plastoquinones and several independent lines of 
biochemical evidence confirm pdsl affects this enzyme 
{Figures 1, 5, 6) . These results provide the first 
5 genetic evidence that plastoquinones are essential 

components for carotenoid synthesis in higher plants, 
most likely as an electron carrier/redox element in 
the desaturation reaction (Figure 2) . The Arabidopsis 
pOHPP dioxygenase gene/cDNA thus provides a basis for 

10 modifying the production of plastoquinones. 

Of- tocopherol and carotenoids in all higher plants. 

Specifically, the specification describes the 
genetic identification of the Arabidopsis pOHPP 
dioxygenase gene by mutational analysis, the physical 

15 isolation and functional confirmation of an 

Arabidopsis pOHPP dioxygenase cDNA, its nucleotide 
sequence and its use to isolate pOHPP dioxygenase 
genes and cDNAs from other plant species. Also 
included in the specification is a description of the 

2 0 use of the Arabidopsis pOHPP dioxygenase cDNA, and 
related cDNAs from other plants, to positively or 
negatively modify the expression/activity of pOHPP 
dioxygenase by recombinant techniques (overexpression, 
cosuppression, antisense, etc.) in any and all plant 

25 tissues, especially leaf and fruit tissues, to 

positively or negatively affect the production of 
a- toe, PQ and carotenoids. 

Elevating pOHPP dioxygenase protein levels 
increases the amount of homogentisic acid (HGA) 

30 synthesized in plant tissues. Because HGA is the 

limiting precursor molecule for a-toc and PQ synthesis 
(the end products of the pathway) , increasing HGA 
synthesis increases the levels of a-toc (Vitamin E) 
and PQ in plant tissues. The increase in PQ 

35 indirectly increases the synthesis of carotenoids, 
which require PQ for their synthesis. In addition, 
the increase in PQ increases photosynthetic efficiency 
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by increasing electron flow between photosystem II and 
photosystem I, because PQ is the primary electron 
transporter between the two photosystems . The 
increase in a-toc, a well-studied antioxidant in 
5 mammals, increases the ability of plants to withstand 
oxidative stresses, such as that caused by high light, 
high temperature, water stress, ozone stress, UV 
stress or other abiotic or biotic stresses. Elevating 
the levels of pOHPP dioxygenase will modify the dose 

10 response curve of herbicides targeting pOHPP 

dioxygenase, thus increasing the relative resistance 
to such herbicides in transgenic plants as compared to 
- native plants of the same species. Inhibiting the 
expression of pOHPP dioxygenase is expected to have 

15 the opposite effect. 



Oenetic Construct 

To express pOHPP dioxygenase in a plant, it is 
required that a DNA sequence containing the pOHPP 
dioxygenase coding sequence be combined with 
!0 regulatory sequences capable of expressing the coding 
sequence in a plant. A number of effective plant 
promoters, both constitutive and developmentally or 
tissue specific, are known to those of skill in the 
art. A transcriptional termination sequence 
25 (polyadenylation sequence) may also be added. Plant 

expression vectors, or plasmids constructed for 
expression of inserted coding sequences in plants, are 
widely used in the art to assemble chimeric plant 
expression constructs including the coding sequence, 
3 0 and to conveniently transfer the constructs into 

plants. A sequence which codes for pOHPP dioxygenase 
includes, for example, SEQ ID NO:l, or versions of the 
designated sequence sufficient to effect coding for 
the expression of pOHPP dioxygenase. Commonly used 
35 methods of molecular biology well-known to those of 
skill in the art may be used to manipulate the DNA 
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sequences . 

By "genetic construct" we mean any of a variety 
of ways of combining the protein-encoding sequences 
with a promoter sequence (and termination sequence, if 
5 necessary) in a manner that operably connects the 
promoter sequence (and termination sequence, if 
present) with the protein-encoding sequences. 
Typically, the promoter sequence will be "upstream" of 
a protein-encoding sequence, while the termination 

10 sequence, if used, will be "downstream'* of the 
protein-encoding sequences . 

The protein-encoding, promoter and termination 
sequences may be combined on a plasmid or viral 
vector, and inserted into a microbial host. Other 

15 functional sequences may be added to the gene 

construct. Alternatively, the protein-encoding, 
promoter, and termination sequence, if added, may be 
combined with any other needed functional sequences 
and used without a vector. 

20 The DNA sequence described by SEQ ID N0:1 is 

sufficient to effect coding for the expression of 
pOHPP dioxygenase . However, it is envisioned that the 
above sequence could be truncated and still confer the 
same properties. It is not known at present which 

25 specific deletions would be successful, but it is 

likely that some deletions to the protein would still 
result in effective enzymatic activity. One skilled 
in the art of molecular biology would be able to take 
the designated sequence and perform deletional 

30 analysis experiments to determine what portions of the 
designated sequence are essential to effect coding for 
the expression of pOHPP dioxygenase . One could create 
a genetic construct with the candidate deletion 
mutations and perform experiments as described below 

35 in the Examples, to test whether such deletion 

mutation sequences effect coding for the enzyme. 
Expression of the enzyme activity indicates a 
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successful deletion mutant or mutants. In this 
manner, one could determine which parts of the 
designated sequence is essential for expression of the 
enzyme . 

5 It is also known that the genetic code is 

degenerate, meaning that more than one codon, or set 
of three nucleotides, codes for each amino acid. Thus 
it is possible to alter the DNA coding sequence to a 
protein, such as the sequence for pOHPP dioxygenase 
10 described here, without altering the sequence of the 

protein produced. Selection of codon usage may affect 
expression level in a particular host. Such changes 
in codon usage are also contemplated here. 

It is further contemplated that using the 
15 Arabidopsis pOHPP gene coding sequence described here, 
that the homologous pOHPP dioxygenase sequences from 
other higher plants can be readily recovered. 
Oligonucleotides can be made from the sequence set 
forth below to either hybridize against cDNA or 
20 genomic libraries or used for PGR amplification of 
homologous pOHPP dioxygenase sequences from other 
plants . 

Once a pOHPP gene is in hand, whether from 
Arabidopsis or from some other plant species, it then 

25 becomes possible to insert a chimeric plant expression 
genetic construct into any plant species of interest. 
— Suitable plant transformation methods exist to insert 
such genetic constructs into most, if not all, 
commercially important plant species. Presently known 

30 methods include Agrobacterium-mediated transformation, 
coated-particle gene delivery (Biolistics) and 
electroporation, in which an electric voltage is used 
to facilitate gene insertion. All these methods, and 
others, can insert the genetic construct into the 

35 genome of the resulting transgenic plant in such a way 
that the genetic construct becomes an inheritable 
trait, transmitted to progeny of the original 
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transgenic plant by the normal rules of Mendelian 
inheritance. Thus, once a genetic construct 
expressing a pOHPP gene is inserted into a plant, it 
can become a part of a plant breeding program for 
5 transfer into any desired genetic background. 

To over-express pOHPP dioxygenase, a genetic 
construct may be used with a higher strength promoter. 
To inhibit expression of endogenous pOHPP dioxygenase, 
an antisense genetic construct can be made, as is 
10 known by those of skill in the art, to reduce the 
level of pOHPP dioxygenase present in the plant 
tissues . 

EXAMPLES 

Isolation of pdsl , a mutant defective In carotenoid 
15 synthesis 

. -To further understand carotenoid biosynthesis and 
. its integration with other pathways in the chloroplast 
in higher plants, the present inventors studied the 
pathway by isolating Arabidopsis thaliana mutants 

20 that are blocked in carotenoid synthesis. 

Plants homozygous for defects in the early stages 
of carotenoid synthesis (e.g. prior to production of 
/8- carotene) are lethal when grown in soil and the 
isolation of such mutations requires the design of 

25 screening procedures to identify plants heterozygous 
for soil lethal mutations. The present inventors 
fovind that most soil lethal, homozygous 
pigment -deficient Arabidopsis mutants can be grown to 
near maturity in tissue culture on Murashige and Skoog 

3 0 basal media (Murashige and Skoog, Physiol . Plant . 
15:473-497, 1962) supplemented with sucrose (MS2 
media) . Under these conditions, photosynthesis and 
chloroplast development are essentially dispensable 
and all the energy and nutritional needs of the plant 

35 are supplied by the media. 

Greater than 500 lines from the 10,000 member 
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Feldmann T-DNA tagged Arabidopsis thaliana population 
(Forsthoefel et al . , Aust . J. Plant Phvsiol . 19:353- 
366, 19.92) were selected for pigment analysis based on 
their segregation for lethal pigment mutations. Seed 
5 from plants heterozygous for lethal pigment mutations 
were surface sterilized, grown on MS2 media, the 
segregating pigment mutants identified, tissue 
harvested from individual plants, and HPLC pigment 
analysis performed. Although numerous mutant lines 
10 with severe pigment deficiencies were identified, only 
two were found to be carotenoid biosynthetic mutants. 
One mutant line isolated from this group, pdsl, is 
~ described in detail here. 

The hallmark phenotype for disruption of a 
15 biosynthetic pathway is the accumulation of an 

intermediate compound prior to the site of blockage. 
Such blockage of the carotenoid pathway can be 
mimicked chemically by treatment of wild-type plants 
with the herbicide NFZ, an inhibitor of the phytoene 
20 desaturase enzyme (Figure l) which has been reported 
to cause accumulation of phytoene in treated tissues. 
Britton, 7. Naturforsch 34c:979-985, 1979. Figs. 3A- 
3E present the results of pigment analysis of wild- 
type, NFZ-wt, and pdsl tissues. Abbreviations in 
25 Figs. 3A-3E are as follows: N, neoxanthin; V, 

violaxanthin; L, lutein; Cb, chlorophyll b; Ca, 
" chlorophyll a; ^, /3- carotene. 

Figure 3A shows Reverse Phase HPLC analysis of 
the carotenoids that accumulate in wild-type 
30 Arabidopsis thaliana leaves. In comparison, Figure 3B 
shows the pigment profile for NFZ treated wild- type 
(NFZ-wt) . Spectral analysis of the strongly absorbing 
296nm peak at 3 3 minutes in NFZ-Wt tissue shows 
absorbance maxima at 276, 286, and 298nm, indicative 
35 of phytoene (Figure 3D) . Figure 3C shows pigment 
analysis of tissue culture grown homozygous pdsl 
mutant plants. The low absorbance at 440nm in Figures 
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3B and C demonstrates that like NFZ-Wt, pdsl mutants 
lack all chlorophylls and carotenoids that normally 
accumulate in wild-type tissue (compare to Figure 3A) . 
However, unlike wild-type, pdsl mutants contain a peak 
5 with a retention time at approximately 3 3 minutes that 
absorbs strongly at 296nm. The retention time and 
absorbance of the 33 -minute peak in the pdsl mutant 
corresponds to the phytoene peak in pigment extracts 
of NFZ-Wt tissue (Figure 3B) . Spectral analysis of 

10 the 33 -minute peak from pdsl is shown in Figure 3E and 
is virtually identical to the spectra of phytoene from 
NFZ-Wt tissue (Figure 3D) as well as to the published 
spectra for phytoene. These results confirm the 
chemical identity of the accumulating compound in pdsl 

15 as phytoene and conclusively demonstrate that the pdsl 
mutation disrupts carotenoid biosynthesis. 

Carotenold Analysis 

For quantitative and qualitative carotenoid 
analysis, plant tissue is placed in a microfuge tube 

20 and ground with a micropestle in 200^1 of 80% acetone. 
120^1 of ethyl acetate is added and the mixture 
vortexed. 140/il of water is added and the mixture 
centrifuged for 5 minutes. The carotenoid containing 
upper phase is then transferred to a fresh tube and 

25 vacuum dried in a Jouan RClOlO Centrifugal Evaporator. 

The dried extract is resuspended in ethyl acetate at a 
concentration of 0 . 5mg fresh weight of tissue per ^1 
and either analyzed immediately by HPLC or stored at 
-80*C under nitrogen. 

30 Carotenoids were separated by reverse -phase HPLC 

analysis on a Spherisorb 0DS2 5 micron C^^a column, 25 
cm in length (Phase Separations Limited, Norwalk, CT) 
using a 45 minute gradient of Ethyl Acetate (0-100%) 
in Acetonitrile/water/triethylamine (9:1:0.01 v/v) , at 

3 5 a flow rate of 1 ml per minute (Goodwin and Britton, 
1988) . Carotenoids were identified by retention time 
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relative to known standards with detection at both 
296nm and 440nm. When needed, absorption spectra for 
individual peaks were obtained with a Hewlett Packard 
1040A photodiode array detector and compared with 
5 published spectra or available standards. 



Quinone analysis 

Quinones were extracted from tissue using a 
method modified from that described in Bligh et al . , 
Can. J, Biochem. Phvsiol. 37:911-917, 1959. Frozen 
10 plant tissue was ground in a mortar with 3 volumes of 
chloroform and 6 volumes of methanol and transferred 
to a test tube. Water and additional chloroform were 
• added until a biphasic mixture was obtained. The 
quinone containing chloroform phase was then 
15 collected. To increase yields, the aqueous phase was 
back-extracted with chloroform, the two chloroform 
phases pooled, and then filtered through Whatman #3 
filter paper. The resulting filtrate was dried under 
a constant stream of nitrogen. Once dried, the pellet 
20 was resuspended in methanol at a concentration of lOmg 
fresh weight per ml and immediately analyzed by HPLC. 
Quinones were resolved by reversed-phase HPLC analysis 
on a LiChrosorb RP-8, 5 micron column, 25cm in length, 
(Alltech, San Jose, CA) using an isocratic solvent of 
25 10% H2O in Methanol for the first 14 minutes, at which 
ti^^ t-he solvent was switched to 100% methanol for the 
remainder of the run (modified from the method 
described in Lichtenthaler , Handbook of 
rhr-nm;.toaraphv , CRC Press, 115-159, 1984). The flow 
30 rate was 1ml per minute for the duration. Peaks were 
identified based upon the retention time of known 
standards with detection at 280nm for a- tocopherol and 
260nm for plastoquinone and ubiquinone as well as by 
absorption spectra from a Hewlett Packard 104 OA 
35 photodiode array detector. When needed, fractions 

represented by individual chromatographic peaks were 
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collected, and submitted to the Southwest 
Environmental Health Science Center, Analytical Core 
laboratory for mass spectral analysis. Results were 
obtained using a TSQ7000 tandem mass spectrometer 
5 (Finnigan Corp., San Jose, CA) equipped with an 

atmospheric pressure chemical ionization source 
operated in the positive ion mode. The instrument was 
set to unit resolution and the samples were introduced 
into the source in a 0 . 3 ml/minute methanol stream and 
10 ionized using a 5kV discharge. 

Genetic analysis of pdal 

. .The genetic nature of the pdsl mutation was 
determined by analyzing seeds resulting from selfing 
pdsl heterozygous plants. Prior to desiccation, Fl 

15 seeds were scored as either green (wild-type or 
heterozygous) or white (homozygous) . A 3:1 
segregation ratio was observed (146 green seeds: 48 
white seeds) , indicating that pdsl is inherited as 
single recessive nuclear mutations (X^ = 0.01, p > 0.90) . 

20 Because pdsl mutants are inhibited in the desaturation 
of phytoene, the inventors believed that it might be a 
mutation in the phytoene - desaturase enzyme, which had 
previously been mapped to chromosome 4, between ag and 
bp. Wetzel et al . , Plant J. 6:161-175, 1994. To test 

25 this hypothesis, the pdsl mutation was mapped relative 
to visible markers. The pdsl mutation was found to 
map to chromosome 1, approximately 7 cM from disl 
toward clv2. Franzmann et al . , Plant J. 7:341-3 50, 
1995. These data points are summarized in Figure 4 

3 0 and establish that pdsl does not map to the phytoene 
desaturase enzyme locus, thus proving that the pdsl 
mutation is not in the phytoene desaturase enzyme. 
This data provided important insight for 
characterization of the pdsl mutant. 
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Homozygous pdsl mutants can be rescued by Homogentisic 
Acid, an intermediate in plas toquinone and tocopherol 
biosynthesis 

As described earlier, previous research 
5 suggesting a role for quinones and pOHPP dioxygenase 

in phytoene desaturation lead the present inventors to 
investigate the quinone biosynthetic pathway in the 
pdsl mutant . The early stages of 

plastoquinone/tocopherol synthesis were functionally 
10 analyzed by growth in the presence of two intermediate 
compounds in the pathway, p-hydroxyphenylpyruvate 
(pOHPP) and homogentisic acid (HGA) (refer to Figures 
1 and 2) . Albino pdsl homozygous plants were first 
germinated on MS2 media and then transferred to MS2 
is' media supplemented with IOOmM of either pOHPP or HGA. 

pdsl plants remained albino when transferred to media 
containing pOHPP but greening occurred when pdsl 
plants were transferred to media containing HGA. 
Figs. 5A-5C present the results of complementation of 
20 the pdsl mutation with homogentisic acid. Each 

profile represents pigments extracted from lOmg fresh 
weight of tissue. Abbreviations used in Figs. 5A-5C 
are as described in Figs. 3A-3E. HPLC analysis with 
detection at 440nm, of the carotenoids extracted from 
25 pdsl plants grown on pOHPP and HGA are shown in 

Figures 5B and C, respectively. The pigment profiles 
of pdsl mutants grown on pOHPP are similar to the 
profiles of pdsl plants grown on MS2 media shown in 
Figure 3B. Comparison of the pigment profiles for 
30 pdsl + HGA tissue and wild-type tissue (Figures 5A and 
5C) indicates that growth in the presence of HGA is 
able to quv' itatively restore a wild-type carotenoid 
profile tr ".bino, homozygous pdsl plants. These 
results i: -ate that the pdsl mutation affects the 
35 enzyme pOH dioxygenase, because pdsl mutants are not 
altered by growth on the substrate of this enzyme, 
pOHPP, but rather, are restored qualitatively to 
wild- type pigmentation by growth on the product of 
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this enzyme, HGA (refer to Figures 1 and 2). The 
complementation of pdsl with HGA also indicates that 
intermediates or end products of this pathway 
(plastoquinone and/or tocopherols, refer to Figures 1 
5 and 2) are necessary components for phytoene 

desaturation in plants' and confirms the observation of 
Schultz et al . in FEBS where inhibitors of pOHPP 
dioxygenase were shown to cause accumulation of 
phytoene . 

10 HPLC analysis conclusively demonstrates that pdsl is a 
mutation in the plastoquinone/ tocopherol blosynthetlc 
pathway that also affects carotenold synthesis 

In addition to biochemical complementation of 
pdsl mutants, the plastoquinone/tocopherol pathway was 

15 also directly analyzed in pdsl tissue by utilizing Cq 
HPLC to resolve total lipid extracts and identify 
three separate classes of quinones : ubiquinone, 
plastoquinone, and of- tocopherol (Vitamin E) (Figures 5 
and 6) . Ubiquinone and plastoquinone perform 

20 analogous electron transport functions in the 

mitochondria and chloroplast, respectively, but are 
synthesized by different pathways in separate 
subcellular compartments (Goodwin et al.. Introduction 
to Plant Biochemistry , Oxford, Pergamon Press, 1983) , 

25 making ubiquinone an ideal internal control in these 

analyses. Figure 6 shows the Cg HPLC analysis of lipid 
soluble extracts from NFZ-Wt tissue and pdsl tissue. 
In NFZ-Wt tissue (Figure 6A) , peaks 3 and 4 were 
identified as ubiquinone and plastoquinone, 

30 respectively, based on retention time (26 and 27 

minutes) , optical spectra, and mass spectra (results 
not shown) . NFZ-Wt tissue contained a peak (1) with a 
retention time of 13.5 minutes which was identified as 
Of- tocopherol based upon the retention time of a 

3 5 standard. However, optical spectroscopy and mass 

spectrometry demonstrated that peak 1 was composed of 
two major components: of- tocopherol (la) and an 



-21- 



wo 97/27285 PCT/US97/01384 

unidentified compound (lb) . The mass of a- tocopherol 
was determined to be 430 as indicated by the presence 
of the 431 protonated molecule while the molecular 
mass of the unidentified compound was 412, as . 
5 indicated by the presence of the 413 protonated 

molecule (data not shown) , clearly demonstrating the 
presence of two compound in peak 1. This quinone 
analysis demonstrates that the herbicide NFZ, which 
specifically inhibits the phytoene desaturase enzyme, 
10 does not affect synthesis of homogent isate derived 

quinones, pdsl tissue (Figure 6B) contain ubiquinone 
(peak 3) but lack plastoquinone (peak 4) . 
- Additionally, though pdsl contains a peak at 13.5 
minutes, optical spectroscopy and mass spectrometry 
15 data demonstrate that this peak lacks a- tocopherol 

(la) and is composed solely of the compound lb (data 
not shown) . Therefore, homozygous pdsl plants 
accumulate ubiquinone but lack both plastoquinone and 
a-tocopherol . This is consistent with the pdsl 
20 mutation affecting pOHPP dioxygenase (refer to Figures 
1 and 2) , as suggested by the rescue of the mutation 
by HGA, and provide additional evidence that the pdsl 
mutation disrupts pOHPP dioxygenase. 

Isolation of a truncated, putative pOHPP dioxygenase 
2 5 AraJbidopsis cDNA 

The observation of Schultz et al . demonstrating 
that inhibitors of pOHPP dioxygenase activity disrupt 
- carotenoid synthesis and cause accumulation of 
phytoene provided important insight for the 
30 characterization of the pdsl mutant which in turn 

provided the present inventors with important insight 
for the isolation of a putative cDNA for the pdsl 
locus. In animals, genetic defects which inhibit the 
activity of pOHPP dioxygenase lead to tyrosinemia type 
35 I, a fatal inherited disease in aromatic amino acid 
catabolism characterized by the presence of high 
levels of pOHPP in the urine. 
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In an effort to further understand the nature of 
this disease, pOHPP dioxygenase cDNAs have been cloned 
from several mammalian and bacterial sources 
(summarized in Ruetschi et al., Eur. J. Biochem. 
5 205:459-466, 1992), Amino acid identity between 

various mammalian pOHPP dioxygenase enzymes is >80%; 
in comparison, their identity to bacterial homologs is 
very low, less than 28%. By using mammalian and 
bacterial sequences to search the Expressed Sequence 

10 Tags (ESTs) computer DNA database (Newman et al . , 
Plant Phvsiol. 106:1241-1255, 1994), one partial 
length Arabidopsis EST was identified and used as a 
probe. The partial length Arahidopsis probe 
corresponds to base pairs 1072 through 1500 of SEQ ID 

15 NO:l. 

This cDNA contained only 9 9 amino acids of the 
carboxyl terminal portion of the protein coding 
region. The deduced protein sequence of this putative 
Arahidopsis pOHPP dioxygenase cDNA shows similar 

20 homology (-50% identity) to both the mammalian and 
bacterial pOHPP dioxygenases . Interestingly, the 
partial Arabidopsis sequence also contains a 15 amino 
acid insertion not found in the human or bacterial 
enzymes. Finally, alignment of six pOHPP dioxygenase 

25 sequences from mammals and bacteria identified three 
regions of high conservation, the highest being a 16 
amino acid region near the carboxy end of pOHPP 
dioxygenases that shows 62.5% identity across all 
phyla. Ruetschi et al . , Eur. J. Biochem. 205:459-466, 

30 1992. This region is also present in the truncated 
Arabidopsis sequence. The lines of evidence suggest 
that the partial length Arabidopsis cDNA described 
above encodes a pOHPP dioxygenase, most likely the 
pdsl locus . 
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Isolation and Characterization of a full length 
AraJbidopsispOHPP dioxygenase cDNA 

Utilizing the partial length Arabidopsis cDNA 
probe, an Arabidopsis cDNA library was screened by 
5 nucleic acid hybridization for full length cDNAs . A 
large number of hybridizing cDNAs were isolated, and 
one of the longest, pHPPl . 5 , containing a 1,52 0 bp 
insertion, was sequenced completely; the insert is 
presented as SEQ ID NO: 1. pHPPl . 5 encodes a 446 
10 amino acid protein (presented as SEQ ID NO:2) , which 
is slightly larger in size than mammalian and 
bacterial pOHPP dioxygenases , pHPPl . 5 shows 34-40% 
identity at the amino acid level to pOHPP dioxygenases 
from various mammals and bacteria. In comparing four 
15 bacterial pOHPP dioxygenases and one mammalian pOHPP 
dioxygenases (pig) which ranged in size from 346-404 
amino acids, Denoya et al . identified 6 9 amino acids 
that were conserved between all five pOHPP 
dioxygenases. Denoya et al . , J. Bacterid. 176:5312- 
20 5319, 1994. The pHPPl.5 coding region contains 52 of 
these 69 conserved amino acids. 

Demonstration that pHPPl.5 encodes an active pOHPP 
dioxygenase protein and complenients the pdsl mutation 

In order to definitively demonstrate that pHPPl.5 
2 5 is the gene product encoded by the pdsl locus and chat 
it encodes a functional pOHPP dioxygenase protein, the 
pHPPi . 5 cDNA was cloned into a plant transformation 
vector for molecular complementation experiments with 
the pdsl. The full length wild- type pOHPP dioxygenase 
30 cDNA will be subcloned into a plant transformation 
vector driven by the Cauliflower Mosaic Virus 35S 
(CaMV) promoter and containing all necessary 
termination cassettes and selectable markers (KanH . 
The CaMV promoter is a strong constitutive promoter. 
3 5 This single construct and the vector without the 

pHPPl.5 insert (as a control) will be used in vacuum 
infiltration transformation which uses whole soil 
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grown plants and will be done on plants that are 
heterozygous for the pdsl mutation. Bouchez et al . , CR 
Acad. Sci. Paris, Sciences de la vie 316, 1993. 

In the standard procedure, 20-30 soil grown 
5 plants will be independently transformed and analyzed 
separately. In this case homozygous plants containing 
the pdsl mutation would be lethal while heterozygous 
plants containing the pdsl mutation would be 
segregating 2:1 for the pdsl mutation in their 

10 siliques. The inventors will use a similar number of 
wild type plants in a parallel transformation as a 
control. After transformation of the pdsl segregating 
plant population, as the plants are setting seed the 
inventors can easily identify those heterozygous for 

15 the pdsl mutation in retrospect by inspection of their 
siliques which would contain green: white embryos in a 
3:1 ratio. 

Seed harvested from individually transformed 
heterozygous pdsl plants will be germinated on 
20 kanamycin and resistant seedlings transferred to soil. 
Segregation analysis of seed from these primary 
transf ormants (T2 seed) and T3 seed for segregation of 
the pdsl phenotype (albino and phytoene accumulating) 
and the T-DNA encoded kanamycin resistance marker 

2 5 (wild type pOHPP dioxygenase cDNA) will conclusively 

demonstrate complementation of the pdsl mutation with 
the pOHPP dioxygenase cDNA. To provide additional 
proof that pHPPl.5 is encoded by the pdsl locus, the 
pHPPl.5 cDNA has been mapped relative to the pdsl 
30 locus using recombinant inbred lines, as described in 
Lister et al . , Plant J. 4:745-750, 1993. The pHPPl . 5 
cDNA mapped to the region of chromosome 1 containing 
the pdsl mutation (Figure 4). Finally, the pHPPl.5 
cDNA will be overexpressed in E. coli and the activity 

3 5 of the protein determined. 
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Modification of pOHPP Expression 

From the genetic and biochemical studies 
described above it is clear that only one pOHPP 
dioxygenase gene product is involved in chloroplast ic 
5 quinone synthesis, that the pdsl mutation defines this 
gene, that the pHPPl . 5 cDNA is the product encoded by 
the pdsl locus and that disruption of its function 
completely eliminates Vitamin E production and 
plastoquinone and carotenoid synthesis in plant 
10 tissues. Modification of pOHPP dioxygenase expression 
in plants by molecular techniques using pHPPl . 5 can 
therefore be used to positively or negatively affect 
the production of tocopherols, plastoquinones directly 
and carotenoids indirectly (refer to Figures 1 and 2) . 
15 Specifically, overexpression of the pOHPP dioxygenase 
enzyme will result in increased levels of one or more 
of these compounds in the tissues of transgenic 
plants. Alternatively, using antisense techniques, it 
is possible to lower the level of enzyme activity to 
20 decrease the levels of these compounds in plants. 

Additionally, overexpression of the pOHPP dioxygenase 
will enable a transgenic plant to withstand elevated 
levels of herbicides that target this enzyme, 
providing agrinomically significant herbicide 
25 resistance relative to normal plants. 

Two different plant systems, Arabidopsis and 
tomato, are being used to demonstrate the effects of 
modified pOHPP dioxygenase in plant tissues. 
Constitutive overexpression of pOHPP dioxygenase will 
30 be done in both plant systems utilizing the CaMV 35S 
promoter and the pHPPl.5 cDNA, The consequences of 
this altered expression on tocopherol, plastoquinone 
and carotenoid levels and profiles in various plant 
tissues will be determined as described below. In 
35 tomato, tissue specific overexpression of pOHPP 

dioxygenase (pHPPl.5) will be driven by the fruit 
specific promoter derived from the tomato jSsubunit 
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gene, which is expressed specifically in developing, 
but not ripening tomato fruit. This will determine 
the potential for modifying the levels of tocopherol, 
plastoquinone and carotenoids specifically in 
5 developing and ripening fruit for nutritional purposes 
without affecting their production in other plant 
tissues. These combined experiments will determine 
whether pOHPP dioxygenase is a rate limiting step in 
chloroplastic homogentisic acid derived quinone 

10 synthesis and the potential for manipulating 

chloroplastic homogentisic acid derived quinones 
(tocopherols and plastoquinones ) and compounds that 
require quinones for their synthesis (carotenoids, 
etc) by increasing pOHPP dioxygenase activity. 

15 Multiple independent transf ormants will be 

produced for each construct and plant species used. 
The integration and gene copy number of each chimeric 
gene in each line will be confirmed by southern 
analysis, the level of pOHPP mRNA determined by 

20 Northern blot analysis, pOHPP dioxygenase activity 
determined as described in Schulz et al . , FEBS 
318:162-166, 1993, and the effects on individual 
chloroplastic components of interest analyzed 
(tocopherols, plastoquinones and carotenoids). In 

25 green tissue containing constitut ively expressing 
constructs this analysis can occur relatively soon 
after transf ormants are put into soil. Analysis of 
fruit specific construct lines will require much more 
time for fruit set to occur. Analysis of tocopherols, 

3 0 plastoquinones and carotenoids will be by a 

combination of HPLC, optical and mass spectra as 
described in Norris et al . (1995, in press). Analysis 
of tocopherol levels is performed by HPLC and when 
needed by GC: mass spectroscopy in selected ion mode. 

35 In MS analysis the absolute level of tocopherol will 
be quantified by isotopic dilution with a known, 
"heavy carbon" tocopherol standard added at the start 
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of the extraction. Determination based on fresh 
weight of tissue can also be performed. Plastoquinone 
levels will be quantified by C8 HPLC and optical 
spectra as described in Norris et al . (1995, in 
5 press) . Total carotenoid levels are determined 

spectrophotometrically and the levels of individual 
carotenes quantified by C18 HPLC and optical spectra 
quantified to standards. In the course of these 
experiments we will identify high expressing lines 
10 with simple insertions that segregate as single 
genetic loci in progeny. This will facilitate 
analysis of the inheritance of the gene and phenotype 
in future generations. 

nwrexpression of pOHPP for in v itro Herbicide 

15 Analysis 

pOHPP dioxygenase will be overexpressed in 
coli or other prokaryotic or eukaryotic protein 
production systems and purified in large amounts for 
use in enzymatic assays for identifying new herbicide 

20 compounds (pOHPP inhibitors) and optimizing existing 
chemistries through detailed kinetic analysis. 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1519 base pairs 

3 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pHPPl , 5 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
45 (B) LOCATION: 37.. 1374 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CCACGCGTCC GAGTTTTAGC AGAGTTGGTG AAATCA ATG GGC CAC CAA AAC GCC 54 

Met Gly His Gin Asn Ala 
1 5 

5 GCC GOT TCA GAG AAT CAA AAC CAT GAT GAC GGC GCT GCG TCG TCG CCG 102 

Ala Val Ser Glu Asn Gin Asn His Asp Asp Gly Ala Ala Ser Ser Pro 
10 15 20 

GGA TTC AAG CTC GTC GGA TTT TCC AAG TTC GTA AGA AAG AAT CCA AAG 150 
Gly Phe Lys Leu Val Gly Phe Ser Lys Phe Val Arg Lys Asn Pro Lys 
10 25 30 35 

TCT GAT AAA TTC AAG GTT AAG CGC TTC CAT CAC ATC GAG TTC TGG TGC 198 
Ser Asp Lys Phe Lys Val Lys Arg Phe His His lie Glu Phe Trp Cys 
40 45 50 

GGC GAC GCA ACC AAC GTC GCT CGT CGC TTC TCC TGG GGT CTG GGG ATG 246 
Glv ASP Ala Thr Asn Val Ala Arg Arg Phe Ser Trp Gly Leu Gly Met 
55 60 65 70 

AGA TTC TCC GCC AAA TCC GAT CTT TCC ACC GGA AAC ATG GTT CAC GCC 2 94 
Arq Phe Ser Ala Lys Ser Asp Leu Ser Thr Gly Asn Met Val His Ala 
75 80 85 

20 TCT TAG CTA CTC ACC TCC GGT GAC CTC CGA TTC CTT TTC ACT GCT CCT 3 42 

ser Tvr Leu Leu Thr Ser Gly Asp Leu Arg Phe Leu Phe Thr Ala Pro 
^ 90 95 100 

TAC TCT CCG TCT CTC TCC GCC GGA GAG ATT AAA CCG ACA ACC ACA GCT 3 90 
Tvr Ser Pro Ser Leu Ser Ala Gly Glu He Lys Pro Thr Thr Thr Ala 
25 105 110 115 

TCT ATC CCA AGT TTC GAT CAC GGC TCT TGT CGT TCC TTC TTC TCT TCA 4 38 
Ser He Pro Ser Phe Asp His Gly Ser Cys Arg Ser Phe Phe Ser Ser 
120 125 130 

CAT GGT CTC GGT GTT AGA GCC GTT GCG ATT GAA GTA GAA GAC GCA GAG 4 86 
30 His Gly Leu Gly Val Arg Ala Val Ala He Glu Val Glu Asp Ala Glu 

135 140 145 150 

TCA GCT TTC TCC ATC AGT GTA GCT AAT GGC GCT ATT CCT TCG TCG CCT 534 
Ser Ala Phe Ser He Ser Val Ala Asn Gly Ala He Pro Ser Ser Pro 
155 160 165 

3 5 CCT ATC GTC CTC AAT GAA GCA GTT ACG ATC GCT GAG GTT AAA CTA TAC 582 

Pro He Val Leu Asn Glu Ala Val Thr He Ala Glu Val Lys Leu Tyr 
t 175 180 

GGC GAT GTT GTT CTC CGA TAT GTT AGT TAC AAA GCA GAA GAT ACC GAA 6 30 
Gly Asp Val val Leu Arg Tyr Val Ser Tyr Lys Ala Glu Asp Thr Glu 
40 185 190 195 

AAA TCC GAA TTC TTG CCA GGG TTC GAG CGT GTA GAG GAT GCG TCG TCG 678 
Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg Val Glu Asp Ala Ser Ser 
200 205 210 

TTC CCA TTG GAT TAT GGT ATC CGG CGG CTT GAC CAC GCC GTG GGA AAC 726 
4 5 Phe Pro Leu Asp Tyr Gly He Arg Arg Leu Asp His Ala Val Gly Asn 

215 220 225 230 

GTT CCT GAG CTT GGT CCG GCT TTA ACT TAT GTA GCG GGG TTC ACT GGT 774 
Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr Val Ala Gly Phe Thr Gly 
235 240 245 
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TTT CAC CAA TTC GCA GAG TTC ACA GCA GAG GAG GTT GGA ACC GCC GAG 82 2 
Phe His Gin Phe Ala Glu Phe Thr Ala Asp Asp Val Gly Thr Ala Glu 
250 255 260 

AGO GGT TTA AAT TCA GCG GTC CTG GCT AGO AAT GAT GAA ATG GTT CTT 8 70 
5 Ser Gly Leu Asn Ser Ala Val Leu Ala Ser Asn Asp Glu Met Val Leu 

265 270 275 

CTA CCG ATT AAC GAG CCA GTG CAC GGA ACA AAG AGG AAG AGT CAG ATT 918 
Leu Pro lie Asn Glu Pro Val His Gly Thr Lys Arg Lys Ser Gin lie 
280 285 290 

10 CAG ACG TAT TTG GAA CAT AAC GAA GGC GCA GGG CTA CAA CAT CTG GCT 96 6 

Gin Thr Tyr Leu Glu His Asn Glu Gly Ala Gly Leu Gin His Leu Ala 
295 300 305 310 

CTG ATG AGT GAA GAC ATA TTC AGG ACC CTG AGA GAG ATG AGG AAG AGG 1014 
Leu Met Ser Glu Asp lie Phe Arg Thr Leu Arg Glu Met Arg Lys Arg 
15 315 320 325 

AGC AGT ATT GGA GGA TTC GAC TTC ATG CCT TCT CCT CCG CCT ACT TAC 106 2 
Ser Ser lie Gly Gly Phe Asp Phe Met Pro Ser Pro Pro Pro Thr Tyr 
330 335 340 

TAC CAG AAT CTC AAG AAA CGG GTC GGC GAC GTG CTC AGC GAT GAT CAG 1110 
2 0 Tyr Gin Asn Leu Lys Lys Arg Val Gly Asp Val Leu Ser Asp Asp Gin 

345 350 355 

ATC AAG GAG TGT GAG GAA TTA GGG ATT CTT GTA GAC AGA GAT GAT CAA 115 8 
lie Lys Glu Cys Glu Glu Leu Gly lie Leu Val Asp Arg Asp Asp Gin 
360 365 370 

2 5 GGG ACG TTG CTT CAA ATC TTC ACA AAA CCA CTA GGT GAC AGG CCG ACG 1206 

Gly Thr Leu Leu Gin lie Phe Thr Lys Pro Leu Gly Asp Arg Pro Thr 
375 380 385 390 

ATA TTT ATA GAG ATA ATC CAG AGA GTA GGA TGC ATG ATG AAA GAT GAG 12 54 
He Phe He Glu He He Gin Arg Val Gly Cys Met Met Lys Asp Glu 
30 395 400 405 

GAA GGG AAG GCT TAC CAG AGT GGA GGA TGT GGT GGT TTT GGC AAA GGC 13 02 
Glu Gly Lys Ala Tyr Gin Ser Gly Gly Cys Gly Gly Phe Gly Lys Gly 
410 415 420 

AAT TTC TCT GAG CTC TTC AAG TCC ATT GAA GAA TAC GAA AAG ACT CTT 13 50 

3 5 Asn Phe Ser Glu Leu Phe Lys Ser He Glu Glu Tyr Glu Lys Thr Leu 

425 430 435 

GAA GCC AAA CAG TTA GTG GGA TGA ACAAGAAGAA GAACCAACTA AAGGATTGTG 14 04 
Glu Ala Lys Gin Leu Val Gly ♦ 
440 445 

4 0 TAATTAATGT AAAACTGTTT TATCTTATCA AAACAATGTT ATACAACATC TCATTTAAAA 14 64 

ACGAGATCAA TCAAAAAATA CAATCTTAAA TTCAAAACCA AAAAAAAAAA AAAAA 1519 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Glv His Gin Asn Ala Ala Val Ser Glu Asn Gin Asn His Asp Asp 
1 5 10 15 

Gly Ala Ala Ser Ser Pro Gly Phe Lys Leu Val Gly Phe Ser Lys Phe 
5 20 25 30 

val Arg Lys Asn Pro Lys Ser Asp Lys Phe Lys Val Lys Arg Phe His 
3 5 40 4 5 

His He Glu Phe Trp Cys Gly Asp Ala Thr Asn Val Ala Arg Arg Phe 
50 55 60 

10 Ser Trp Gly Leu Gly Met Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr 

65 70 75 

Glv Asn Met val His Ala Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg 
' 85 90 

Phe Leu Phe Thr Ala Pro Tyr Ser Pro Ser Leu Ser Ala Gly Glu He 
100 105 110 

Lvs pro Thr Thr Thr Ala Ser He Pro Ser Phe Asp His Gly Ser Cys 
^ 115 120 125 

Arq Ser Phe Phe Ser Ser His Gly Leu Gly Val Arg Ala Val Ala He 
130 135 140 

20 Glu val Glu ASP Ala Glu Ser Ala Phe Ser He Ser Val Ala Asn Gly 

j_45 150 15b 

Ala He Pro Ser Ser Pro Pro He Val Leu Asn Glu Ala Val Thr He 
165 170 1'= 

Ala Glu val Lys Leu Tyr Gly Asp Val Val Leu Arg Tyr Val Ser Tyr 
180 185 

LVS Ala Glu Asp Thr Glu Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg 
' 195 200 205 

Val Glu Asp Ala ser Ser Phe Pro Leu Asp Tyr Gly He Arg Arg Leu 
210 215 220 

ASP His Ala Val Gly Asn Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr 
225 230 235 

Val Ala Gly Phe Thr Gly Phe His Gin Phe Ala Glu Phe Thr Ala Asp 
245 250 255 

Asp Val Gly Thr Ala Glu Ser Gly Leu Asn Ser Ala Val Leu Ala Ser 
35 260 265 270 

Asn ASP Glu Met val Leu Leu Pro He Asn Glu Pro Val His Gly Thr 
275 280 285 

Lvs Arg Lys Ser Gin He Gin Thr Tyr Leu Glu His Asn Glu Gly Ala 
290 295 300 

4 0 Gly Leu Gin His Leu Ala Leu Met Ser Glu Asp He Phe Arg Thr Leu 

305 310 315 

Arg Glu Met Arg Lys Arg Ser Ser He Gly Gly Phe Asp Phe Met Pro 
^ 325 330 335 

Ser Pro Pro Pro Thr Tyr Tyr Gin Asn Leu Lys Lys Arg Val Gly Asp 
45 340 345 350 



25 
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Val Leu Ser Asp Asp Gin lie Lys Glu Cys Glu Glu Leu Gly lie Leu 
355 360 365 

Val Asp Arg Asp Asp Gin Gly Thr Leu Leu Gin lie Phe Thr Lys Pro 
370 375 380 

5 Leu Gly Asp Arg Pro Thr lie Phe lie Glu lie lie Gin Arg Val Gly 

385 390 395 400 

Cys Met Met Lys Asp Glu Glu Gly Lys Ala Tyr Gin Ser Gly Gly Cys 
405 410 415 

Gly Gly Phe Gly Lys Gly Asn Phe Ser Glu Leu Phe Lys Ser lie Glu 
10 420 425 430 

Glu Tyr Glu Lys Thr Leu Glu Ala Lys Gin Leu Val Gly * 
435 440 445 
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CLAIMS 

I claim: 



1. A biologically pure sample of DNA, the DNA 
comprising a DNA sequence coding for the expression of 
a plant p-hydroxyphenyl pyruvic acid dioxygenase . 

2 . A vector containing the DNA sequence of 
claim 1. 

3. A microbial host transformed by the vector 
of claim 2, 

4. The DNA of claim 1, wherein the p- 
hydroxyphenyl pyruvic acid dioxygenase is from 
Arabidopsis thaliana . 

5. A transgenic tomato plant transformed with a 
DNA construct including the DNA of claim 1. 

6 . A transgenic Arabidopsis plant transformed 
with a DNA construct including the DNA of claim 1 

7. The biologically pure DNA of claim 1 wherein 
the DNA is SEQ ID NO: 2. 

8. A DNA plant gene expression construct 
comprising : 

25 a. a DNA sequence coding for the 

expression of a plant p-hydroxyphenyl pyruvic acid 

dioxygenase ; 

b. a promoter effective in plant cells 
located 5' to the DNA coding sequence; and 

c, a 3' termination sequence effective in 

plant cells. 
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9. A DNA construct comprising: 

a. a promoter capable of expressing a 
downstream coding sequence in a tomato plants- 
fa . a DNA sequence coding for the 
5 expression of a p-hydroxyphenyl pyruvic acid 
dioxygenase of plant origin; and 

c. a 3' termination sequence, the 
construction capable of expressing a p-hydroxyphenyl 
pyruvic acid dioxygenase gene when transformed into 
10 tomato plants . 

10. A bacteria containing the construction of 
Claim 9. 

11. A tomato plant cell containing the 
construction of Claim 9. 

15 12 . An Arabidopsis plant cell containing the 

construction of Claim 9. 

13 . A transgenic tomato plant comprising in its 
genome a foreign genetic construction comprising, 5' 
to 3', a promoter effective in tomato, a DNA coding 
20 region encoding p-hydroxyphenyl pyruvic acid 

dioxygenase, and a transcriptional terminator, the 
genetic construction effective in vivo in tomato 
plants to stimulate expression of p-hydroxyphenyl 
pyruvic acid dioxygenase. 

25 14. Seed of the tomato plant of claim 13. 

15. Fruit of the tomato plant of claim 13. 

16. The transgenic tomato plant of claim 11 
wherein the DNA coding region encoding p-hydroxyphenyl 
pyruvic acid dioxygenase is that set forth in SEQ ID 

30 NO:l. 
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17. A method of suppressing the production of 
vitamin E and plastoquinones in a plant, the method 
comprising the steps of: 

a. isolating a DNA sequence encoding a p- 
5 hydroxyphenyl pyruvic acid dioxygenase of plant 

origin; 

b. creating a genetic construction including, 

5' to 3', a promoter effective in the plant's cells, a 
coding sequence, and a transcriptional terminator, the 
10 coding region being derived from the DNA sequence, 
wherein the DNA sequence from step (a) has been 
altered so that expression of p- hydroxyphenyl pyruvic 
acid dioxygenase is suppressed; and 

c. transforming a cell of the plant with the 
15 genetic construction, whereby the plant's cell 

produces lowered levels of vitamin E and 
plastoquinones . 
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Field Of The Invention 
The present invention relates to a molecular 
approach for modifying the synthesis of vitamin E, 
plastoquinone, and carotenoids in plants by use of a 
full-length cloned cDNA which encodes a p- 
15 hydroxyphenyl pyruvic acid dioxygenase enzyme. 

Backgrou nd Of The Invention 
The chloroplasts of higher plants contain many 
unique, interconnected biochemical pathways that 
produce an array of secondary metabolite compounds 

20 which not only perform vital functions within the 
plant but are also important from agricultural and 
nutritional perspectives. Three such secondary 
metabolites are the lipid soluble, chloroplast ically 
synthesized compounds vitamin E (a- tocopherol or 

25 Qf-toc) , plastoquinones (PQ) , and carotenoids, which 

together perform many crucial biochemical functions in 
the chloroplast. PQ and vitamin E are quinone 
compounds synthesized by a common pathway in the 
plastid; carotenoids are tetraterpenoids synthesized 

30 by a separate plastid-localized pathway. 

Plastoquinone (PQ) often accounts for up to 50% 
of the total plastidic quinone pool in green tissues. 
The primary function of PQ is as a fundamental 
component of the photosynthetic electron transport 

35 chain, acting as an electron carrier between 
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photosystem II and the cytochrome bgf complex. PQ 
likely has other less well studied functions in 
plastids, namely in acting as a direct or intermediate 
electron carrier for a variety of other biosynthetic 
5 reactions in the chloroplast . 

Vitamin E is the second major class of 
chloroplastic quinones, accounting for up to 40% of 
the quinone pool in plastids. The essential 
nutritional value of tocopherols was recognized around 
10 1925, and the compound responsible for Vitamin E 

activity was first identified as of- tocopherol in 1936. 
Q!-Toc has a well-documented role in mammals as an 
antioxidant, and a similar, though less well 
understood antioxidant role in plants. Liebler, et 
15 al., Toxicology 23:147-169, 1993; Hess, Ant i-oxidants 
■in Higher Plants . CRC Press: 111-134, 1993. 

Carotenoids are a separate, diverse group of 
lipophilic pigments synthesized in plants, fungi, and 
bacteria. In photosynthetic tissues, carotenoids 
20 function as accessory pigments in light harvesting and 
play important roles in photo-protection by quenching 
free radicals, singlet oxygen, and other reactive 
species. Sief ermann-Harms , Physiol . Plantarum . 69:561- 
568, 1987. In the plastids of non-photosynthet ic 
25 tissues, high levels of carotenoids often accumulate 
providing the intense orange, yellow, and red 
coloration of many fruits, vegetables, and flowers 
(Pfander, Methods in Enzvm. , 213A, 3-13, 1992) . In 
addition to their many functions in plants, 
30 carotenoids and their metabolites also have important 
functions in animals, where they serve as the major 
source of Vitamin A (retinol) , and have been 
identified as providing protection from some forms of 
cancer due to their antioxidant activities. Vitamin 
35 E's antioxidant activities are also thought to protect 
against some forms of cancer, and may act 
synergistically with carotenoids in this regard. 
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Liebler, et al . , Toxicology 23:147-169, 1993; Krinsky, 
J. Nutr. 119:123-126, 1989. 

Tocopherol and Plastocfuinone Synthesis 

cy-Tocopherol and plastoquinone are the most 
5 abundant quinones in the plastid and are synthesized 
by the common pathway shown in Figure 1 , The 
precursor molecule for both compounds, homogentisic 
acid (HGA) , is produced in the chloroplast from the 
shikimic acid pathway intermediate p-hydroxyphenyl 
10 pyruvic acid (pOHPP) , in an oxidation/decarboxylat ion 
reaction catalyzed by the enzyme p-hydroxyphenyl 
pyruvic acid dioxygenase (pOHPP dioxygenase) . 
Homogentisic acid is subject to 

phytylation/prenylation (phytylpyrophosphate and 
15 solanylpyrophosphate, Cjo and C^^, respectively) coupled 
to a simultaneous decarboxylation by a phytyl/prenyl 
transferase to form the first true tocopherol and 
plastoquinone intermediates , 
2-demethylphytylplastoquinol and 
20 2-demethylplastoquinol-9, respectively. A single ring 
methylation occurs on 2-demethylplastoquinol to yield 
plastoquinol-9 (PQHj) which is then oxidized to 
plastoquinone -9 (PQ) . This oxidation is reversible 
and is the basis of electron transport by 

2 5 plastoquinone in the chloroplast. 

The preferred route, as established in spinach, 
for Of- tocopherol formation from 

2-demethylphytylplastoquinol appears to be 1) ring 
methylation of the intermediate, 2-Qf- 

3 0 demethylphytylplastoquinol, to yield 

phytylplastoquinol, 2) cyclization to yield 
d-tocopherol and, finally, 3) a second ring 
methylation to yield Of- tocopherol . Ring methylation 
in both tocopherol and plastoquinone synthesis is 
35 carried out by a single enzyme that is specific for 
the site of methylation on the ring, but has 
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relatively broad substrate specificity and 
accommodates both classes of quinone compounds. This 
methylation enzyme is the only enzyme of the pathway 
that has been purified from plants to date. 
5 d'Harlingue, et al . , J.Biol .Chem. 26:15200, 1985. All 
enzymatic activities of the a-toc/PQ pathway have been 
localized to the inner chloroplast envelope by cell 
fractionation studies except for pOHPP dioxygenase and 
the tocopherol cyclase enzyme. Difficulties with cell 
10 fractionation methods, low activities for some of the 
enzymes, substrate stability and availability and 
assay problems, make studying the pathway 
biochemically difficult. 

Vitamin E and PQ levels, ratios, and total 
15 amounts vary by orders of magnitude in different 
plants, tissues and developmental stages. Such 
variations indicate that the vitamin E and PQ pathway 
is both highly regulated and has the potential for 
manipulation to modify the absolute levels and ratios 
20 of the two end products. The pathway in Figure 1 

makes it clear that production of homogentisic acid by 
pOHPP dioxygenase is likely to be a key regulatory 
point for bulk flow through the pathway, both because 
HGA production is the first committed step in a-toc/PQ 
25 synthesis, and also because the reaction is 

essentially irreversible. Therefore modifying the 
■ levels of HGA by modifying pOHPP dioxygenase activity 
should have a direct impact on the total a-toc/PQ 
biosynthetic accumulation in plant tissues, and, as 
30 described below, because of the connection of PQ and 
carotenoid synthesis, should also affect carotenoid 
synthesis in plant tissues. 



Carotenoid Biosynthesis; Qulnones as Electron Carriers 

In plants, carotenoids are synthesized and 
3 5 accumulate exclusively in plastids via the pathway 
shown on the left-hand side of Figure 1. The first 
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committed step in carotenoid synthesis is the 
condensation of two molecules of the C20 hydrocarbon 
geranylgeranyl pyrophosphate (GGDP) by the enzyme 
phytoene synthase, to form the colorless C^o 
5 hydrocarbon, phytoene. In oxygenic photosynthetic 
organisms (e.g. plants, algae, and cyanobacteria) , 
phytoene undergoes two sequential desaturation 
reactions, catalyzed by phytoene desaturase, to 
produce carotene through the intermediate 

10 phytofluene* Subsequently, f -carotene undergoes two 
further desaturations, catalyzed by f-carotene 
desaturase, to yield the red pigment lycopene . 
Lycopene is cyclized to produce either or-carotene or 
/3-carotene, both of which are subject to various 

15 hydroxylation and epoxidation reactions to yield the 
carotenoids and xanthophylls most abundant in 
photosynthetic tissues of plants, lutein, /8-carotene, 
violaxanthin and neoxanthin. 

The genes encoding the first two enzymes of the 

2 0 carotenoid pathway (phytoene synthase and phytoene 

desaturase) have been isolated and studied from a 
number of plant and bacterial sources in recent years. 
Sandmann, Eur. J. Biochem. 223:7-24, 1994. Phytoene 
desaturase has been the most intensively studied, both 
25 because it is a target for numerous commercially 

important herbicides, and also because the phytoene 
desaturation reaction is thought to be a rate limiting 
step in carotenoid synthesis . Molecular and 
biochemical studies suggest that two types of phytoene 

3 0 desaturase enzymes have evolved by independent 

evolution: the crtJ-type found in anoxygenic 
photosynthetic organisms (e.g. Rhodobacter and 
Erwinia) , and the pds-type found in oxygenic 
photosynthetic organisms. Despite their differences 
3 5 in primary amino acid sequence, all phytoene 

desaturase enzymes contain a dinucleotide binding 
domain (FAD or NAD/NADP) , which in Capsicum annum has 
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been shown to be FAD. Hugueney et al . , Eur. J. 
Biochem. 209:399-407, 1992. Presumably, the bound 
dinucleotide in both types of phytoene desaturase 
enzymes is reduced during desaturation and reoxidized 
5 by an unknown reductant present in the plastid or 
bacterium. 

Several lines of evidence have suggested a role 
for quinones in the phytoene desaturation reaction in 
higher plants. Using isolated daffodil chromoplasts , 
10 Mayer and co-workers demonstrated that in an anaerobic 
environment, oxidized artificial quinones were 
required for the desaturation of phytoene while 
reduced quinones were ineffective, Mayer et al . , Eur . 
J. Biochem. 191:359-363, 1990. Further supporting 
15 evidence comes from studies with the triketone class 

of herbicides (e.g. Sulcotrione) , which cause phytoene 
accumulation in treated tissues but unlike the well- 
studied pyridazone class (e.g. Norflorazon (NFZ) ) do 
not directly affect the phytoene desaturase enzyme. 
20 Rather, triketone herbicides competitively inhibit 

pOHPP dioxygenase, an enzyme common to the synthesis 
of both plastoquinone and tocopherols, suggesting that 
one or more classes of quinones may play a role in 
carotenoid desaturation reactions. Schulz et al . , 
25 FEES 318:162-166, 1993; Secor, Plant Ph vsiol. 106: 
1429-1433; Beyer et al . , lUPAC Pure and Applied 
Chemistry 66:1047-1056, 1994. 

Despite the well -studied, wide -spread importance 
of vitamin E, plastoquinone, and carotenoids to human 
30 nutrition, agriculture, and biochemical processes 

within plant cells, much remains unclear about their 
biosynthesis and accumulation in plant tissues. This 
uncertainty has in turn limited the potential for 
manipulation of the synthesis and levels of these 
35 important compounds in plants. 
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Summary of the Invention 
In one embodiment, this invention provides a 
biologically pure sample of DNA which DNA comprises a 
sequence coding for the expression of Arabidopsis 
5 thaliana p-hydroxyphenyl pyruvic acid dioxygenase . 

In other embodiments, this invention provides a 
vector and microbial host containing a DNA sequence 
sufficiently homologous to SEQ ID NO:l so as to code 
for the expression of Arabidopsis thaliana p- 
10 hydroxyphenyl pyruvic acid dioxygenase, and a genetic 
construct containing a DNA sequence sufficiently 
homologous to SEQ ID N0:1 so as to code for the 
expression of Arabidopsis thaliana p-hydroxyphenyl 
pyruvic acid dioxygenase, together with a promoter 
15 located 5' to the DNA coding sequence and a 3' 
termination sequence . 

In another embodiment, this invention provides a 
method of creating a transgenic plant in which the 
levels of the pOHPP dioxygenase enzyme are elevated 
20 sufficient such that production of plastoquinones , 
vitamin E, and carotenoids are modified. 

It is an object of the present invention to 
genetically engineer higher plants to modify the 
production of plastoquinones, vitamin E, and 
25 carotenoids. 

It is another object of the invention to provide 
transgenic plants that would express elevated levels 
of the pOHPP dioxygenase enzyme which would have 
resultant elevated resistance to the triketone class 
30 of herbicides (i.e. sulcotrione) • 

It is another object of the present invention to 
provide a method for the preparation of the enzyme p- 
hydroxyphenyl pyruvic acid dioxygenase (pOHPP 
dioxygenase) , an enzyme which can be used to identify 
35 new pOHPPdioxygenase- inhibiting herbicides. 

Other features and advantages of the invention 
will be apparent from the following description of the 
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preferred embodiments thereof and from the claims. 

Brief Description of the Drawings 
Fig. 1 is a diagram of the pathways for synthesis 
of carotenoids, vitamin E (tocopherol) , and 
5 plastoquinone . 

Fig. 2 is a diagram of the interconnections of 
the pathways illustrated in Fig. 1. 

Fig. 3A-3E are graphs of pigment analyses of 
wild-type, NFZ-wt, and pdsl tissues. 
10 Fig. 4 is a physical map of the pdsl mutation 

relative to visible markers. 

Figs. 5A-5C present the results of C18 HPLC 
separation of lipid soluble pigments from wild-type 
plants on MS2 media, homozygous pdsl mutants on MS2 
15 media supplemented with pOHPP, and homozygous pdsl 

mutants on MS2 media supplemented with homogentistic 
acid (HGA) . 

Figs. 6A-6B present the results of C8 HPLC 
analyses of quinones in NFZ-wt and pdsl tissues. 

20 Detailed Description Of The Invention 

As described above, both Vitamin E, 
plastoquinones and carotenoids are synthesized and 
accumulated in plastids by the pathways shown in 
Figure 1. This specification describes the 

25 identification, isolation, characterization and 
functional analysis of a higher plant pOHPP 
dioxygenase cDNA, its role in a-toc, PQ and carotenoid 
synthesis, and the use of this cDNA to modify pOHPP 
dioxygenase activity in plant tissues and hence the 

3 0 accumulation of one or more of the compounds 

plastoquinones, vitamin E, and carotenoids in plant 
tissues. The overexpression of pOHPP dioxygenase in 
transgenic plants will modify the enzyme- to- inhibitor 
ratio of plant tissues exposed to triketone 

35 herbicides, as compared to non- transgenic plants, 
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resulting in increased herbicide resistance. The 
present specification also describes a genetic 
construct for use in the production of pOHPP 
dioxygenase, an enzyme useful in identifying new pOHPP 
5 dioxygenase- inhibiting herbicides. 

By genetic analysis the present inventors have 
shown that the vitamin E, plastoquinone , and 
carotenoid biosynthetic pathways are interconnected 
and share common elements as shown in Figure 2. From 

10 mutational studies in Arabidopsis thalian^, the 
present inventors identified one genetic locus, 
designated pdsl (pds^ E^ytoene desaturation) , the 
disruption of which results in accumulation of the 
first carotenoid of the carotenoid biosynthetic 

15 pathway, phytoene . Surprisingly, though this mutation 
disrupts carotenoid synthesis and was originally 
identified on this basis, it does not map to the locus 
encoding the phytoene desaturase enzyme . Evidence 
indicates that pdsl defines a second gene product in 

20 addition to the phytoene desaturase enzyme, necessary 
for phytoene desaturation and hence carotenoid 
synthesis in higher plants. This gene product proved 
to be pOHPP dioxygenase. 

To provide a molecular mechanism for manipulating 

25 synthesis and accumulation of the compounds 

plastoquinone, vitamin E, and carotenoids, the present 
inventors used a molecular genetic approach, taking 
advantage of the model plant system Arabidopsis 
thaliana to define, isolate and study genes required 

30 for synthesis of the compounds in plants. The 

flowering plant Arabidopsis thaliana has come into 
wide use as a model system to explore the molecular 
biology and genetics of plants. Arabidopsis offers 
many advantages for genetic analysis: it can be selfed 

35 and very large numbers of progeny can be obtained (up 
to 10,000 seeds from a single plant). Furthermore, 
Arabidopsis has a short generation time of five to six 
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weeks, so crosses can be set up and the progeny 
analyzed within reasonable periods of time. Mutation 
screens have identified thousands of mutations 
affecting many aspects of basic plant biology, 
5 including morphogenesis, photosynthesis, fertility, 

starch and lipid metabolism, mineral nutrition, an so 
on. In addition, its haploid genome is only about 10® 
base pairs. 

An important aspect of the successful approach 
10 used here is that essential components were first 
functionally defined genetically, prior to their 
isolation, analysis and molecular manipulation. 
Briefly, potential mutants were identified by a 
combination of phenotypic and biochemical screening, 
15 characterized at the genetic and molecular levels, 

loci of interest selected, and the corresponding genes 
then cloned and studied further. By this approach, 
the inventors genetically defined and isolated cDNAs 
for one gene, pdsl, whose mutation disrupts synthesis 
20 of all three classes of compounds in the plastid, 

tocopherols, plastoquinones and carotenoids. Based on 
biochemical analysis of the pdsl mutant, the pdsi gene 
was identified as affecting the activity of pOHPP 
dioxygenase, a crucial enzyme of the plastidic quinone 
25 pathway in plants (Figure 1) , that is directly 
required for the synthesis of plastoquinone and 
a-tocopherol and indirectly for carotenoid synthesis. 
In particular, the deduced function of the pdsl mutant 
and pOHPP dioxygenase enzyme are noted in Figure 2 . 
3 0 The present inventors demonstrated by biochemical 

complementation that the pdsl mutation affects the 
enzyme p-hydroxyphenyl pyruvic acid dioxygenase (pOHPP 
dioxygenase) , because pdsl plants can be rescued by 
growth on the product but not the substrate of this 
35 enzyme, homogentisic acid (HGA) and 

p-hydroxyphenylpyruvate (pOHPP) , respectively. pOHPP 
dioxygenase is the key branch point enzyme and 



-10- 



wo 97/27285 




PCT/US97/01384 



committed step in the synthesis of both Vitamin E and 
plastoquinones and several independent lines of 
biochemical evidence confirm pdsl affects this enzyme 
(Figures 1, 5, 6) . These results provide the first 
5 genetic evidence that plastoquinones are essential 

components for carotenoid synthesis in higher plants, 
most likely as an electron carrier/redox element in 
the desaturation reaction (Figure 2) . The Arahidopsis 
pOHPP dioxygenase gene/cDNA thus provides a basis for 

10 modifying the production of plastoquinones, 

a-tocopherol and carotenoids in all higher plants. 

Specifically, the specification describes the 
genetic identification of the Arabidopsis pOHPP 
dioxygenase gene by mutational analysis, the physical 

15 isolation and functional confirmation of an 

Arabidopsis pOHPP dioxygenase cDNA, its nucleotide 
sequence and its use to isolate pOHPP dioxygenase 
genes and cDNAs from other plant species. Also 
included in the specification is a description of the 

20 use of the Arabidopsis pOHPP dioxygenase cDNA, and 
related cDNAs from other plants, to positively or 
negatively modify the expression/activity of pOHPP 
dioxygenase by recombinant techniques (overexpression, 
cosuppression, antisense, etc.) in any and all plant 

25 tissues, especially leaf and fruit tissues, to 

positively or negatively affect the production of 
Of -toe, PQ and carotenoids. 

Elevating pOHPP dioxygenase protein levels 
increases the amount of homogentisic acid (HGA) 

30 synthesized in plant tissues. Because HGA is the 

limiting precursor molecule for cy-toc and PQ synthesis 
(the end products of the pathway) , increasing HGA 
synthesis increases the levels of a-toc (Vitamin E) 
and PQ in plant tissues. The increase in PQ 

3 5 indirectly increases the synthesis of carotenoids, 
which require PQ for their synthesis. In addition, 
the increase in PQ increases photosynthet ic efficiency 
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by increasing electron flow between photosystem II and 
photosystem I, because PQ is the primary electron 
transporter between the two photosystems . The 
increase in a-toc, a well-studied antioxidant in 
5 mammals, increases the ability of plants to withstand 
oxidative stresses, such as that caused by high light, 
high temperature, water stress, ozone stress, UV 
stress or other abiotic or biotic stresses. Elevating 
the levels of pOHPP dioxygenase will modify the dose 

10 response curve of herbicides targeting pOHPP 

dioxygenase, thus increasing the relative resistance 
to such herbicides in transgenic plants as compared to 
native plants of the same species. Inhibiting the 
expression of pOHPP dioxygenase is expected to have 

15 the opposite effect. 

Genetic Construct 

To express pOHPP dioxygenase in a plant, it is 
required that a DNA sequence containing the pOHPP 
dioxygenase coding sequence be combined with 
20 regulatory sequences capable of expressing the coding 
sequence in a plant. A number of effective plant 
promoters, both constitutive and developmentally or 
tissue specific, are known to those of skill in the 
art. A transcriptional termination sequence 
25 (polyadenylation sequence) may also be added. Plant 
expression vectors, or plasmids constructed for 
expression of inserted coding sequences in plants, are 
widely used in the art to assemble chimeric plant 
expression constructs including the coding sequence, 
30 and to conveniently transfer the constructs into 

plants. A sequence which codes for pOHPP dioxygenase 
includes, for example, SEQ ID N0:1, or versions of the 
designated sequence sufficient to effect coding for 
the expression of pOHPP dioxygenase. Commonly used 
35 methods of molecular biology well-known to those of 
skill in the art may be used to manipulate the DNA 
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sequences . 

By "genetic construct" we mean any of a variety 
of ways of combining the protein-encoding sequences 
with a promoter sequence (and termination sequence, if 
5 necessary) in a manner that operably connects the 
promoter sequence (and termination sequence, if 
present) with the protein- encoding sequences. 
Typically, the promoter sequence will be "upstream" of 
a protein-encoding sequence, while the termination 

10 sequence, if used, will be "downstream" of the 
protein-encoding sequences . 

The protein-encoding, promoter and termination 
sequences may be combined on a plasmid or viral 
vector, and inserted into a microbial host. Other 

15 functional sequences may be added to the gene 

construct. Alternatively, the protein-encoding, 
promoter, and termination sequence, if added, may be 
combined with any other needed functional sequences 
and used without a vector, 

20 The DNA sequence described by SEQ ID NO : 1 is 

sufficient to effect coding for the expression of 
pOHPP dioxygenase. However, it is envisioned that the 
above sequence could be truncated and still confer the 
same properties. It is not known at present which 

25 specific deletions would be successful, but it is 

likely that some deletions to the protein would still 
result in effective enzymatic activity. One skilled 
in the art of molecular biology would be able to take 
the designated sequence and perform deletional 

3 0 analysis experiments to determine what portions of the 
designated sequence are essential to effect coding for 
the expression of pOHPP dioxygenase. One could create 
a genetic construct with the candidate deletion 
mutations and perform experiments as described below 

3 5 in the Examples, to test whether such deletion 

mutation sequences effect coding for the enzyme. 
Expression of the enzyme activity indicates a 
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successful deletion mutant or mutants. In this 
manner, one could determine which parts of the 
designated sequence is essential for expression of the 
enzyme . 

5 It is also known that the genetic code is 

degenerate, meaning that more than one codon, or set 
of three nucleotides, codes for each amino acid. Thus 
it is possible to alter the DNA coding sequence to a 
protein, such as the sequence for pOHPP dioxygenase 

10 described here, without altering the sequence of the 

protein produced. Selection of codon usage may affect 
expression level in a particular host. Such changes 
in codon usage are also contemplated here , 

It is further contemplated that using the 

15 Arabidopsis pOHPP gene coding sequence described here, 
that the homologous pOHPP dioxygenase sequences from 
other higher plants can be readily recovered. 
Oligonucleotides can be made from the sequence set 
forth below to either hybridize against cDNA or 

20 genomic libraries or used for PGR amplification of 
homologous pOHPP dioxygenase sequences from other 
plants . 

Once a pOHPP gene is in hand, whether from 
Arabidopsis or from some other plant species, it then 
25 becomes possible to insert a chimeric plant expression 
genetic construct into any plant species of interest. 
Suitable plant transformation methods exist to insert 
such genetic constructs into most, if not all, 
commercially important plant species. Presently known 

3 0 methods include Agrobacterium-mediated transformation, 
coated-particle gene delivery (Biolistics) and 
electroporat ion, in which an electric voltage is used 
to facilitate gene insertion. All these methods, and 
others, can insert the genetic construct into the 

3 5 genome of the resulting transgenic plant in such a way 
that the genetic construct becomes an inheritable 
trait, transmitted to progeny of the original 
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transgenic plant by the normal rules of Mendelian 
inheritance. Thus, once a genetic construct 
expressing a pOHPP gene is inserted into a plant, it 
can become a part of a plant breeding program for 
5 transfer into any desired genetic background. 

To over-express pOHPP dioxygenase , a genetic 
construct may be used with a higher strength promoter. 
To inhibit expression of endogenous pOHPP dioxygenase, 
an antisense genetic construct can be made, as is 
10 known by those of skill in the art, to reduce the 
level of pOHPP dioxygenase present in the plant 
tissues . 

EXAMPLES 

Isolation of pdal, a mutant defective In carotenold 
15 syntliesls 

To further understand carotenoid biosynthesis and 
its integration with other pathways in the chloroplast 
in higher plants, the present inventors studied the 
pathway by isolating Arabidopsis thalxana mutants 

20 that are blocked in carotenoid synthesis. 

Plants homozygous for defects in the early stages 
of carotenoid synthesis (e.g. prior to production of 
^-carotene) are lethal when grown in soil and the 
isolation of such mutations requires the design of 

25 screening procedures to identify plants heterozygous 
for soil lethal mutations. The present inventors 
found that most soil lethal, homozygous 
pigment -deficient Arabidopsis mutants can be grown to 
near maturity in tissue culture on Murashige and Skoog 

30 basal media (Murashige and Skoog, Physiol . Plant. 
15:473-497, 1962) supplemented with sucrose (MS2 
media) . Under these conditions, photosynthesis and 
chloroplast development are essentially dispensable 
and all the energy and nutritional needs of the plant 

35 are supplied by the media. 

Greater than 500 lines from the 10,000 member 
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Feldmann T-DNA tagged Arabidopsls thaliana. population 
(Forsthoefel et al., Aust . J. Plant Physiol. 19:353- 
366, 1992) were selected for pigment analysis based on 
their segregation for lethal pigment mutations. Seed 
5 from plants heterozygous for lethal pigment mutations 
were surface sterilized, grown on MS2 media, the 
segregating pigment mutants identified, tissue 
harvested from individual plants, and HPLC pigment 
analysis performed. Although numerous mutant lines 
10 with severe pigment deficiencies were identified, only 
two were found to be carotenoid biosynthetic mutants. 
One mutant line isolated from this group, pdsl, is 
described in detail here. 

The hallmark phenotype for disruption of a 
15 biosynthetic pathway is the accumulation of an 

intermediate compound prior to the site of blockage. 
Such blockage of the carotenoid pathway can be 
mimicked chemically by treatment of wild-type plants 
with the herbicide NFZ, an inhibitor of the phytoene 
20 desaturase enzyme (Figure 1) which has been reported 
to cause accumulation of phytoene in treated tissues. 
Britton, Z. Naturforsch 34c:979-985, 1979, Figs. 3A- 
3E present the results of pigment analysis of wild- 
type, NFZ-wt, and pdsl tissues. Abbreviations in 
25 Figs. 3A-3E are as follows: N, neoxanthin; V, 

violaxanthin; L, lutein; Cb, chlorophyll b; Ca, 
chlorophyll a; (3, ^-carotene. 

Figure 3A shows C^g Reverse Phase HPLC analysis of 
the carotenoids that accumulate in wild- type 
3 0 Arahidopsis thaliana leaves. In comparison, Figure 3B 
shows the pigment profile for NFZ treated wild- type 
(NFZ-wt) . Spectral analysis of the strongly absorbing 
296nm peak at 33 minutes in NFZ-Wt tissue shows 
absorbance maxima at 276, 286, and 298nm, indicative 
35 of phytoene (Figure 3D) . Figure 3C shows pigment 
analysis of tissue culture grown homozygous pdsl 
mutant plants. The low absorbance at 440nm in Figures 
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3B and C demonstrates that like NFZ-Wt, pdsl mutants 
lack all chlorophylls and carotenoids that normally 
accumulate in wild-type tissue (compare to Figure 3A) . 
However, unlike wild- type, pdsl mutants contain a peak 
5 with a retention time at approximately 33 minutes that 
absorbs strongly at 296nm. The retention time and 
absorbance of the 33 -minute peak in the pdsl mutant 
corresponds to the phytoene peak in pigment extracts 
of NFZ-Wt tissue (Figure 3B) . Spectral analysis of 

10 the 33 -minute peak from pdsl is shown in Figure 3E and 
is virtually identical to the spectra of phytoene from 
NFZ-Wt tissue (Figure 3D) as well as to the published 
spectra for phytoene. These results confirm the 
chemical identity of the accumulating compound in pdsl 

15 as phytoene and conclusively demonstrate that the pdsl 
mutation disrupts carotenoid biosynthesis. 

Carotenoid Analysis 

For quantitative and qualitative carotenoid 
analysis, plant tissue is placed in a microfuge tube 

20 and ground with a micropestle in 200/il of 80% acetone. 
120/xl of ethyl acetate is added and the mixture 
vortexed. 140/il of water is added and the mixture 
centrifuged for 5 minutes. The carotenoid containing 
upper phase is then transferred to a. fresh tube and 

25 vacuum dried in a Jouan RClOlO Centrifugal Evaporator. 

The dried extract is resuspended in ethyl acetate at a 
concentration of 0 . 5mg fresh weight of tissue per ^1 
and either analyzed immediately by HPLC or stored at 
-80*C under nitrogen. 

3 0 Carotenoids were separated by reverse -phase HPLC 

analysis on a Spherisorb ODS2 5 micron C^g column, 25 
cm in length (Phase Separations Limited, Norwalk, CT) 
using a 45 minute gradient of Ethyl Acetate (0-100%) 
in Acetonitrile/water/triethylamine (9:1:0.01 v/v) , at 

35 a flow rate of 1 ml per minute (Goodwin and Britton, 
1988) . Carotenoids were identified by retention time 
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relative to known standards with detection at both 
2 9 6nm and 44 0nm. When needed, absorption spectra for 
individual peaks were obtained with a Hewlett Packard 
104 OA photodiode array detector and compared with 
5 published spectra or available standards. 

Quinone analysis 

Quinones were extracted from tissue using a 
method modified from that described in Bligh et al . , 
Can. J. Biochem. Phvsiol. 37:911-917, 1959. Frozen 
10 plant tissue was ground in a mortar with 3 volumes of 
chloroform and 6 volumes of methanol and transferred 
to a test tube* Water and additional chloroform were 
added until a biphasic mixture was obtained. The 
quinone containing chloroform phase was then 
15 collected. To increase yields, the aqueous phase was 
back-extracted with chloroform, the two chloroform 
phases pooled, and then filtered through Whatman #3 
filter paper. The resulting filtrate was dried under 
a constant stream of nitrogen. Once dried, the pellet 
20 was resuspended in methanol at a concentration of lOmg 
fresh weight per ml and immediately analyzed by HPLC. 
Quinones were resolved by reversed-phase HPLC analysis 
on a LiChrosorb RP-8, 5 micron column, 25cm in length, 
(Alltech, San Jose, CA) using an isocratic solvent of 
25 10% HjO in Methanol for the first 14 minutes, at which 
time the solvent was switched to 100% methanol for the 
remainder of the run (modified from the method 
described in Lichtenthaler , Handbook of 
Chromatography , CRC Press, 115-159, 1984) . The flow 
30 rate was 1ml per minute for the duration. Peaks were 
identified based upon the retention time of known 
standards with detection at 280nm for of- tocopherol and 
260nm for plastoquinone and ubiquinone as well as by 
absorption spectra from a Hewlett Packard 104 OA 
35 photodiode array detector. When needed, fractions 

represented by individual chromatographic peaks were 
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collected, and submitted to the Southwest 
Environmental Health Science Center, Analytical Core 
laboratory for mass spectral analysis. Results, were 
obtained using a TSQ7000 tandem mass spectrometer 
5 (Finnigan Corp., San Jose, CA) equipped with an 

atmospheric pressure chemical ionization source 
operated in the positive ion mode. The instrument was 
set to unit resolution and the samples were introduced 
into the source in a 0 . 3 ml/minute methanol stream and 
10 ionized using a 5kV discharge. 



Genetic analysis of pdal 

The genetic nature of the pdsl mutation was 
determined by analyzing seeds resulting from selfing 
pdsl heterozygous plants. Prior to desiccation, Fl 

15 seeds were scored as either green (wild-type or 
heterozygous) or white (homozygous). A 3:1 
segregation ratio was observed (14 6 green seeds: 4 8 
white seeds) , indicating that pdsl is inherited as 
single recessive nuclear mutations {X^=0.01, p > 0.90). 

20 Because pdsl mutants are inhibited in the desaturation 
of phytoene, the inventors believed that it might be a 
mutation in the phytoene desaturase enzyme, which had 
previously been mapped to chromosome 4, between ag and 
bp. Wetzel et al . , Plant J. 6:161-175, 1994. To test 

25 this hypothesis, the pdsl mutation was mapped relative 
to visible markers. The pdsl mutation was found to 
map to chromosome 1, approximately 7 cM from disJ 
toward clv2. Franzmann et al., Plant J. 7:341-350, 
1995. These data points are summarized in Figure 4 

3 0 and establish that pdsl does not map to the phytoene 
desaturase enzyme locus, thus proving that the pdsl 
mutation is not in the phytoene desaturase enzyme. 
This data provided important insight for 
characterization of the pdsl mutant. 



-19- 



wo 97/27285 




PCT/US97/013S4 



Homozygous pdsl mutants can be rescued by Homogentlsic 
Acid, an intermediate in plastoquinone and tocopherol 
biosynthesis 

As described earlier, previous research 
5 suggesting a role for quinones and pOHPP dioxygenase 

in phytoene desaturation lead the present inventors to 
investigate the quinone biosynthetic pathway in the 
pdsl mutant . The early stages of 

plastoquinone /tocopherol synthesis were functionally 
10 analyzed by growth in the presence of two intermediate 
compounds in the pathway, p-hydroxyphenylpyruvate 
(pOHPP) and homogentlsic acid (HGA) (refer to Figures 
1 and 2) . Albino pdsl homozygous plants were first 
germinated on MS2 media and then transferred to MS2 
15 media supplemented with lOO/xM of either pOHPP or HGA. 

pdsl plants remained albino when transferred to media 
containing pOHPP but greening occurred when pdsl 
plants were transferred to media containing HGA. 
Figs. 5A-5C present the results of complementation of 
20 the pdsl mutation with homogentlsic acid. Each 

profile represents pigments extracted from lOmg fresh 
weight of tissue. Abbreviations used in Figs. 5A-5C 
are as described in Figs. 3A-3E. HPLC analysis with 
detection at 44 0nm of the carotenoids extracted from 
2 5 pdsl plants grown on pOHPP and HGA are shown in 

Figures 5B and C, respectively. The pigment profiles 
of pdsl mutants grown on pOHPP are similar to the 
profiles of pdsl plants grown on MS2 media shown in 
Figure 3B. Comparison of the pigment profiles for 
30 pdsl + HGA tissue and wild-type tissue (Figures 5A and 
5C) indicates that growth in the presence of HGA is 
able to qualitatively restore a wild-type carotenoid 
profile to albino, homozygous pdsl plants. These 
results indicate that the pdsl mutation affects the 
35 enzyme pOHPP dioxygenase, because pdsl mutants are not 
altered by growth on the substrate of this enzyme, 
pOHPP, but rather, are restored qualitatively to 
wild- type pigmentation by growth on the product of 
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this enzyme, HGA (refer to Figures 1 and 2) . The 
complementation of pdsl with HGA also indicates that 
intermediates or end products of this pathway 
(plastoquinone and/or tocopherols, refer to Figures 1 
5 and 2) are necessary components for phytoene 

desaturation in plants and confirms the observation of 
Schultz et al. in FEES where inhibitors of pOHPP 
dioxygenase were shown to cause accumulation of 
phytoene . 

10 HPLC analysis conclusively demonstrates that pdsl is a 
mutation in the plastoquinone/ tocopherol biosynthetic 
pathway that also affects carotenoid synthesis 

In addition to biochemical complementation of 
pdsl mutants, the plastoquinone/tocopherol pathway was 

15 also directly analyzed in pdsl tissue by utilizing Cg 
HPLC to resolve total lipid extracts and identify 
three separate classes of quinones : ubiquinone, 
plastoquinone, and a- tocopherol (Vitamin E) (Figures 5 
and 6) . Ubiquinone and plastoquinone perform 

20 analogous electron transport functions in the 

mitochondria and chloroplast, respectively, but are 
synthesized by different pathways in separate 
subcellular compartments (Goodwin et al . , Introduction 
to Plan t Biochemistry . Oxford, Pergamon Press, 1983), 

25 making ubiquinone an ideal internal control in these 

analyses. Figure 6 shows the Cq HPLC analysis of lipid 
soluble extracts from NF2-Wt tissue and pdsl tissue. 
In NFZ-Wt tissue (Figure 6A) / peaks 3 and 4 were 
identified as ubiquinone and plastoquinone, 

30 respectively, based on retention time (26 and 27 

minutes) , optical spectra, and mass spectra (results 
not shown) , NFZ-Wt tissue contained a peak (1) with a 
retention time of 13.5 minutes which was identified as 
a-tocopherol based upon the retention time of a 

3 5 standard. However, optical spectroscopy and mass 

spectrometry demonstrated that peak l was composed of 
two major components: a-tocopherol (la) and an 
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unidentified compound (lb) . The mass of of- tocopherol 
was determined to be 43 0 as indicated by the presence 
of the 431 protonated molecule while the molecular 
mass of the unidentified compound was 412, as 
5 indicated by the presence of the 413 protonated 

molecule (data not shown) , clearly demonstrating the 
presence of two compound in peak 1. This quinone 
analysis demonstrates that the herbicide NFZ, which 
specifically inhibits the phytoene desaturase enzyme, 

10 does not affect synthesis of homogentisate derived 

quinones. pdsl tissue (Figure 6B) contain ubiquinone 
(peak 3) but lack plastoquinone (peak 4) . 
Additionally, though pdsl contains a peak at 13.5 
minutes, optical spectroscopy and mass spectrometry 

15 data demonstrate that this peak lacks a- tocopherol 

(la) and is composed solely of the compound lb (data 
not shown). Therefore, homozygous pdsl plants 
accumulate ubiquinone but lack both plastoquinone and 
Of- tocopherol , This is consistent with the pdsl 

20 mutation affecting pOHPP dioxygenase (refer to Figures 
1 and 2), as suggested by the rescue of the mutation 
by HGA, and provide additional evidence that the pdsl 
mutation disrupts pOHPP dioxygenase. 

Isolation of a truncated, putative pOHPP dioxygenase 
25 Arabidopsis cDNA 

The observation of Schultz et al . demonstrating 
that inhibitors of pOHPP dioxygenase activity disrupt 
carotenoid synthesis and cause accumulation of 
phytoene provided important insight for the 

30 characterization of the pdsl mutant which in turn 

provided the present inventors with important insight 
for the isolation of a putative cDNA for the pdsl 
locus. In animals, genetic defects which inhibit the 
activity of pOHPP dioxygenase lead to tyrosinemia type 

35 I, a fatal inherited disease in aromatic amino acid 
catabolism characterized by the presence of high 
levels of pOHPP in the urine. 
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In an effort to further understand the nature of 
this disease, pOHPP dioxygenase cDNAs have been cloned 
from several mammalian and bacterial sources 
(summarized in Ruetschi et al., Eur. J. Biochem. 
5 205:459-466, 1992), Amino acid identity between 

various mammalian pOHPP dioxygenase enzymes is >80%; 
in comparison, their identity to bacterial homologs is 
very low, less than 28%. By using mammalian and 
bacterial sequences to search the Expressed Sequence 

10 Tags (ESTs) computer DNA database (Newman et al.. 
Plant Phvsiol. 106:1241-1255, 1994), one partial 
length Arabidopsis EST was identified and used as a 
probe. The partial length Arabidopsis probe 
corresponds to base pairs 1072 through 1500 of SEQ ID 

15 NO:l. 

This cDNA contained only 99 amino acids of the 
carboxyl terminal portion of the protein coding 
region. The deduced protein sequence of this putative 
Arabidopsis pOHPP dioxygenase cDNA shows similar 

20 homology (-50% identity) to both the mammalian and 
bacterial pOHPP dioxygenases . Interestingly, the 
partial Arabidopsis sequence also contains a 15 amino 
acid insertion not found in the human or bacterial 
enzymes. Finally, alignment of six pOHPP dioxygenase 

25 sequences from mammals and bacteria identified three 
regions of high conservation, the highest being a 16 
amino acid region near the carboxy end of pOHPP 
dioxygenases that shows 62.5% identity across all 
phyla. Ruetschi et al . , Eur. J. Biochem. 205:459-466, 

30 1992. This region is also present in the truncated 
Arajbidopsis sequence. The lines of evidence suggest 
that the partial length Arabidopsis cDNA described 
above encodes a pOHPP dioxygenase, most likely the 
pdsl locus . 
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Isolation and Characterization of a full length 
Ara±>idopsispOHPP dioxygenase cDNA 

Utilizing the partial length Arahidopsis cDNA 
probe, an Arabidopsis cDNA library was screened by 
5 nucleic acid hybridization for full length cDNAs . A 
large number of hybridizing cDNAs were isolated, and 
one of the longest, pHPPl.5, containing a 1,52 0 bp 
insertion, was sequenced completely; the insert is 
presented as SEQ ID NO: 1. pHPPl . 5 encodes a 446 

10 amino acid protein (presented as SEQ ID NO:2), which 
is slightly larger in size than mammalian and 
bacterial pOHPP dioxygenases . pHPPl , 5 shows 34-40% 
identity at the amino acid level to pOHPP dioxygenases 
from various mammals and bacteria. In comparing four 

15 bacterial pOHPP dioxygenases and one mammalian pOHPP 
dioxygenases (pig) which ranged in size from 346-404 
amino acids, Denoya et al . identified 6 9 amino acids 
that were conserved between all five pOHPP 
dioxygenases. Denoya et al . , J. Bacteriol. 176:5312- 

20 5319, 1994. The pHPPl . 5 coding region contains 52 of 
these 6 9 conserved amino acids. 

Demonstration that pHPPl.5 encodes an active pOHPP 
dioxygenase protein and complements the pdsl mutation 

In order to definitively demonstrate that pHPPl . 5 
25 is the gene product encoded by the pdsl locus and that 
it encodes a functional pOHPP dioxygenase protein, the 
pHPPl.5 cDNA was cloned into a plant transformation 
vector for molecular complementation experiments with 
the pdsl. The full length wild-type pOHPP dioxygenase 

30 cDNA will be subcloned into a plant transformation 
vector driven by the Cauliflower Mosaic Virus 35S 
(CaMV) promoter and containing all necessary 
termination cassettes and selectable markers (Kan"") . 
The CaMV promoter is a strong constitutive promoter. 

35 This single construct and the vector without the 

pHPPl.5 insert (as a control) will be used in vacuum 
infiltration transformation which uses whole soil 
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grown plants and will be done on plants that are 
heterozygous for the pdsl mutation. Bouchez et al., CR 
Acad. Sci. Paris, Sciences de la vie 316, 1993. 

In the standard procedure, 2 0-30 soil grown 
5 plants will be independently transformed and analyzed 
separately. In this case homozygous plants containing 
the pdsl mutation would be lethal while heterozygous 
plants containing the pdsl mutation would be 
segregating 2:1 for the pdsl mutation in their 

10 siliques. The inventors will use a similar number of 
wild type plants in a parallel transformation as a 
control. After transformation of the pdsl segregating 
plant population, as the plants are setting seed the 
inventors can easily identify those heterozygous for 

15 the pdsl mutation in retrospect by inspection of their 
siliques which would contain green: white embryos in a 
3 : 1 ratio . 

Seed harvested from individually transformed 
heterozygous pdsl plants will be germinated on 

20 kanamycin and resistant seedlings transferred to soil. 
Segregation analysis of seed from these primary 
transformants (T2 seed) and T3 seed for segregation of 
the pdsl phenotype (albino and phytoene accumulating) 
and the T-DNA encoded kanamycin resistance marker 

25 (wild type pOHPP dioxygenase cDNA) will conclusively 

demonstrate complementation of the pdsl mutation with 
the pOHPP dioxygenase cDNA. To provide additional 
proof that pHPPl . 5 is encoded by the pdsl locus, the 
pHPPl.5 cDNA has been mapped relative to the pdsl 

3 0 locus using recombinant inbred lines, as described in 
Lister et al . , Plant J. 4:745-750, 1993. The pHPPl . 5 
cDNA mapped to the region of chromosome 1 containing 
the pdsl mutation (Figure 4). Finally, the pHPPl . 5 
cDNA will be overexpressed in E. coli and the activity 

3 5 of the protein determined. 
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Modification of pOHPP Expression 

From the genetic and biochemical studies 
described above it is clear that only one pOHPP 
dioxygenase gene product is involved in chloroplast ic 
5 quinone synthesis, that the pdsl mutation defines this 
gene, that the pHPPl . 5 cDNA is the product encoded by 
the pdsl locus and that disruption of its function 
completely eliminates Vitamin E production and 
plastoquinone and carotenoid synthesis in plant 
10 tissues. Modification of pOHPP dioxygenase expression 
in plants by molecular techniques using pHPPl . 5 can 
therefore be used to positively or negatively affect 
the production of tocopherols, plastoquinones directly 
and carotenoids indirectly (refer to Figures 1 and 2) . 
15 Specifically, overexpression of the pOHPP dioxygenase 
enzyme will result in increased levels of one or more 
of these compounds in the tissues of transgenic 
plants. Alternatively, using antisense techniques, it 
is possible to lower the level of enzyme activity to 
20 decrease the levels of these compounds in plants. 

Additionally, overexpression of the pOHPP dioxygenase 
will enable a transgenic plant to withstand elevated 
levels of herbicides that target this enzyme, 
providing agrinomically significant herbicide 
25 resistance relative to normal plants. 

Two different plant systems, Arabidopsis and 
tomato, are being used to demonstrate the effects of 
modified pOHPP dioxygenase in plant tissues. 
Constitutive overexpression of pOHPP dioxygenase will 
3 0 be done in both plant systems utilizing the CaMV 3 5S 
promoter and the pHPPl.5 cDNA. The consequences of 
this altered expression on tocopherol, plastoquinone 
and carotenoid levels and profiles in various plant 
tissues will be determined as described below. In 
35 tomato, tissue specific overexpression of pOHPP 

dioxygenase (pHPPl.5) will be driven by the fruit 
specific promoter derived from the tomato )3subunit 
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gene, which is expressed specifically in developing, 
but not ripening tomato fruit. This will determine 
the potential for modifying the levels of tocopherol, 
plastoquinone and carotenoids specifically in 
5 developing and ripening fruit for nutritional purposes 
without affecting their production in other plant 
tissues. These combined experiments will determine 
whether pOHPP dioxygenase is a rate limiting step in 
chloroplastic homogentisic acid derived quinone 

10 synthesis and the potential for manipulating 

chloroplastic homogentisic acid derived quinones 
(tocopherols and plastoquinones ) and compounds that 
require quinones for their synthesis (carotenoids, 
etc) by increasing pOHPP dioxygenase activity. 

15 Multiple independent transf ormants will be 

produced for each construct and plant species used. 
The integration and gene copy number of each chimeric 
gene in each line will be confirmed by southern 
analysis, the level of pOHPP mRNA determined by 

2 0 Northern blot analysis, pOHPP dioxygenase activity 

determined as described in Schulz et al . , FEBS 
318:162-166, 1993, and the effects on individual 
chloroplastic components of interest analyzed 
(tocopherols, plastoquinones and carotenoids) . In - 
25 green tissue containing constitut ively expressing 
constructs this analysis can occur relatively soon 
after transf ormants are put into soil. Analysis of 
fruit specific construct lines will require much more 
time for fruit set to occur. Analysis of tocopherols, 

3 0 plastoquinones and carotenoids will be by a 

combination of HPLC, optical and mass spectra as 
described in Norris et al . (1995, in press). Analysis 
of tocopherol levels is performed by HPLC and when 
needed by GCrmass spectroscopy in selected ion mode. 
35 In MS analysis the absolute level of tocopherol will 
be quantified by isotopic dilution with a known, 
"heavy carbon" tocopherol standard added at the start 
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of the extraction. Determination based on fresh 
weight of tissue can also be performed. Plastoquinone 
levels will be quantified by C8 HPLC and optical 
spectra as described in Norris et al . (1995, in 
5 press) . Total carotenoid levels are determined 

spectrophotometrically and the levels of individual 
carotenes quantified by C18 HPLC and optical spectra 
quantified to standards. In the course of these 
experiments we will identify high expressing lines 
10 with simple insertions that segregate as single 
genetic loci in progeny. This will facilitate 
analysis of the inheritance of the gene and phenotype 
in future generations. 

Overexpression of- pQHPP for in vitro Herbicide 
15 Analvsis 

pOHPP dioxygenase will be overexpressed in 
coli or other prokaryotic or eukaryotic protein 
production systems and purified in large amounts for 
use in enzymatic assays for identifying new herbicide 
20 compounds (pOHPP inhibitors) and optimizing existing 
chemistries through detailed kinetic analysis. 
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(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CCACGCGTCC GAGTTTTAGC AGAGTTGGTG AAATCA ATG GGC CAC CAA AAC GCC 54 

Met Gly His Gin Asn Ala 
1 5 

5 GCC GTT TCA GAG AAT CAA AAC CAT GAT GAC GGC GCT GCG TCG TCG CCG 102 

Ala Val Ser Glu Asn Gin Asn His Asp Asp Gly Ala Ala Ser Ser Pro 
10 15 20 

GGA TTC AAG CTC GTC GGA TTT TCC AAG TTC GTA AGA AAG AAT CCA AAG 150 
Gly Phe L»ys L.eu Val Gly Phe Ser Lys Phe Val Arg L.ys Asn Pro Lys 
10 25 30 35 

TCT GAT AAA TTC AAG GTT AAG CGC TTC CAT CAC ATC GAG TTC TGG TGC 198 
Ser Asp Lys Phe Lys Val Lys Arg Phe His His lie Glu Phe Trp Cys 
40 45 50 

GGC GAC GCA ACC AAC GTC GCT CGT CGC TTC TCC TGG GGT CTG GGG ATG 246 
15 Gly Asp Ala Thr Asn Val Ala Arg Arg Phe Ser Trp Gly Leu Gly Met 

55 60 65 70 

AGA TTC TCC GCC AAA TCC GAT CTT TCC ACC GGA AAC ATG GTT CAC GCC 2 94 
Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr Gly Asn Met Val His Ala 
75 80 85 

2 0 TCT TAC CTA CTC ACC TCC GGT GAC CTC CGA TTC CTT TTC ACT GCT CCT 34 2 

Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg Phe Leu Phe Thr Ala Pro 
90 95 100 

TAC TCT CCG TCT CTC TCC GCC GGA GAG ATT AAA CCG ACA ACC ACA GCT 3 90 
Tyr Ser Pro Ser Leu Ser Ala Gly Glu lie Lys Pro Thr Thr Thr Ala 
25 105 110 115 

TCT ATC CCA AGT TTC GAT CAC GGC TCT TGT CGT TCC TTC TTC TCT TCA 43 8 
Ser lie Pro Ser Phe Asp His Gly Ser Cys Arg Ser Phe Phe Ser Ser 
120 125 130 

CAT GGT CTC GGT GTT AGA GCC GTT GCG ATT GAA GTA GAA GAC GCA GAG 4 86 

3 0 His Gly Leu Gly Val Arg Ala Val Ala lie Glu Val Glu Asp Ala Glu 

135 140 145 150 

TCA GCT TTC TCC ATC AGT GTA GCT AAT GGC GCT ATT CCT TCG TCG CCT 53 4 
Ser Ala Phe Ser lie Ser Val Ala Asn Gly Ala lie Pro Ser Ser Pro 
155 160 165 

3 5 CCT ATC GTC CTC AAT GAA GCA GTT ACG ATC GCT GAG GTT AAA CTA TAC 582 

Pro lie Val Leu Asn Glu Ala Val Thr lie Ala Glu Val Lys Leu Tyr 
170 175 180 

GGC GAT GTT GTT CTC CGA TAT GTT AGT TAC AAA GCA GAA GAT ACC GAA 63 0 
Gly Asp Val Val Leu Arg Tyr Val Ser Tyr Lys Ala Glu Asp Thr Glu 
40 185 190 195 

AAA TCC GAA TTC TTG CCA GGG TTC GAG CGT GTA GAG GAT GCG TCG TCG 67 8 
Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg Val Glu Asp Ala Ser Ser 
200 205 210 

TTC CCA TTG GAT TAT GGT ATC CGG CGG CTT GAC CAC GCC GTG GGA AAC- 726 

4 5 Phe Pro Leu Asp Tyr Gly lie Arg Arg Leu Asp His Ala Val Gly Asn 

215 220 225 230 

GTT CCT GAG CTT GGT CCG GCT TTA ACT TAT GTA GCG GGG TTC ACT GGT 774 
Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr Val Ala Gly Phe Thr Gly 
235 240 245 
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TTT CAC CAA TTC GCA GAG TTC ACA GCA GAC GAC GTT GGA ACC GCC GAG 8 22 
Phe His Gin Phe Ala Glu Phe Thr Ala Asp Asp Val Gly Thr Ala Glu 
250 255 260 

AGC GGT TTA AAT TCA GCG GTC CTG GCT AGC AAT GAT GAA ATG GTT CTT 8 70 
5 Ser Gly Leu Asn Ser Ala Val Leu Ala Ser Asn Asp Glu Met Val Leu 

265 270 275 

CTA CCG ATT AAC GAG CCA GTG CAC GGA ACA AAG AGG AAG AGT CAG ATT 918 
Leu Pro lie Asn Glu Pro Val His Gly Thr Lys Arg Lys Ser Gin lie 
280 285 290 

10 CAG ACG TAT TTG GAA CAT AAC GAA GGC GCA GGG CTA CAA CAT CTG GCT 966 

Gin Thr Tyr Leu Glu His Asn Glu Gly Ala Gly Leu Gin His Leu Ala 
295 300 305 310 

CTG ATG AGT GAA GAC ATA TTC AGG ACC CTG AGA GAG ATG AGG AAG AGG 1014 
Leu Met Ser Glu Asp lie Phe Arg Thr Leu Arg Glu Met Arg Lys Arg 
15 315 320 325 

AGC AGT ATT GGA GGA TTC GAC TTC ATG CCT TCT CCT CCG CCT ACT TAC 1062 
Ser Ser lie Gly Gly Phe Asp Phe Met Pro Ser Pro Pro Pro Thr Tyr 
330 335 340 

TAC CAG AAT CTC AAG AAA CGG GTC GGC GAC GTG CTC AGC GAT GAT CAG 1110 

2 0 Tyr Gin Asn Leu Lys Lys Arg Val Gly Asp Val Leu Ser Asp Asp Gin 

345 350 355 

ATC AAG GAG TGT GAG GAA TTA GGG ATT CTT GTA GAC AGA GAT GAT CAA 1158 
lie Lys Glu Cys Glu Glu Leu Gly lie Leu Val Asp Arg Asp Asp Gin 
360 365 370 

25 GGG ACG TTG CTT CAA ATC TTC ACA AAA CCA CTA GGT GAC AGG CCG ACG 12 06 

Gly Thr Leu Leu Gin lie Phe Thr Lys Pro Leu Gly Asp Arg Pro Thr 
375 380 385 390 

ATA TTT ATA GAG ATA ATC CAG AGA GTA GGA TGC ATG ATG AAA GAT GAG 1254 
lie Phe lie Glu lie lie Gin Arg Val Gly Cys Met Met Lys Asp Glu 
30 395 400 405 

GAA GGG AAG GCT TAC CAG AGT GGA GGA TGT GGT GGT TTT GGC AAA GGC 13 02 
Glu Gly Lys Ala Tyr Gin Ser Gly Gly Cys Gly Gly Phe Gly Lys Gly 
410 415 420 

AAT TTC TCT GAG CTC TTC AAG TCC ATT GAA GAA TAC GAA AAG ACT CTT 13 50 

3 5 Asn Phe Ser Glu Leu Phe Lys Ser lie Glu Glu Tyr Glu Lys Thr Leu 

425 430 435 

GAA GCC AAA CAG TTA GTG GGA TGA ACT^GAAGAA GAACCAACTA AAGGATTGTG 14 04 
Glu Ala Lys Gin Leu Val Gly ♦ 
440 445 

4 0 TAATTAATGT AAAACTGTTT TATCTTATCA AAACAATGTT ATACAACATC TCATTTAAAA 1464 

ACGAGATCAA TCAAAAAATA CAATCTTAAA TTCAAAACCA AAAAAAAAAA AAAAA 1519 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 
4 5 (B) TYPE: amino acid 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly His Gin Asn Ala Ala Val Ser Glu Asn Gin Asn His Asp Asp 
15 10 15 

Gly Ala Ala Ser Ser Pro Gly Phe Lys Leu Val Gly Phe Ser Lys Phe 
5 20 25 30 ' 

Val Arg Lys Asn Pro Lys Ser Asp Lys Phe Lys Val Lys Arg Phe His 
35 40 45 

His lie Glu Phe Trp Cys Gly Asp Ala Thr Asn Val Ala Arg Arg Phe 
50 55 60 

10 Ser Trp Gly Leu Gly Met Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr 

65 70 75 80 

Gly Asn Met Val His Ala Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg 
85 90 95 

Phe Leu Phe Thr Ala Pro Tyr Ser Pro Ser Leu Ser Ala Gly Glu lie 
15 100 105 110 

Lys Pro Thr Thr Thr Ala Ser He Pro Ser Phe Asp His Gly Ser Cys 
115 120 125 

Arg Ser Phe Phe Ser Ser His Gly Leu Gly Val Arg Ala Val Ala He 
130 135 140 

20 Glu Val Glu Asp Ala Glu Ser Ala Phe Ser He Ser Val Ala Asn Gly 

145 150 155 160 

Ala He Pro Ser Ser Pro Pro He Val Leu Asn Glu Ala Val Thr He 
165 170 175 

Ala Glu Val Lys Leu Tyr Gly Asp Val Val Leu Arg Tyr Val Ser Tyr 
25 180 185 190 

Lys Ala Glu Asp Thr Glu Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg 
195 200 205 

Val Glu Asp Ala Ser Ser Phe Pro Leu Asp Tyr Gly He Arg Arg Leu 
210 215 220 

3 0 Asp His Ala Val Gly Asn Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr 

225 230 235 240 

Val Ala Gly Phe Thr Gly Phe His Gin Phe Ala Glu Phe Thr Ala Asp 
245 250 255 

Asp Val Gly Thr Ala Glu Ser Gly Leu Asn Ser Ala Val Leu Ala Ser 
35 260 265 270 

Asn Asp Glu Met Val Leu Leu Pro He Asn Glu Pro Val His Gly Thr 
275 280 285 

Lys Arg Lys Ser Gin He Gin Thr Tyr Leu Glu His Asn Glu Gly Ala 
290 295 300 

4 0 Gly Leu Gin His Leu Ala Leu Met Ser Glu Asp He Phe Arg Thr Leu 

305 310 315 320 

Arg Glu Met Arg Lys Arg Ser Ser He Gly Gly Phe Asp Phe Met Pro 
325 330 335 

Ser Pro Pro Pro Thr Tyr Tyr Gin Asn Leu Lys Lys Arg Val Gly Asp 
45 340 345 350 
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Val Leu Ser Asp Asp 
355 

Val Asp Arg Asp Asp 
370 

5 Leu Gly Asp Arg Pro 

385 

Cys Met Met Lys Asp 
405 

Gly Gly Phe Gly Lys 
10 420 

Glu Tyr Glu Lys Thr 
435 



Gin lie Lys Glu Cys Glu 

360 

Gin Gly Thr Leu Leu Gin 
375 

Thr He Phe He Glu He 
390 395 

Glu Glu Gly Lys Ala Tyr 
410 

Gly Asn Phe Ser Glu Leu 
425 

Leu Glu Ala Lys Gin Leu 
440 



Glu Leu Gly lie Leu 
365 



lie Phe Thr Lys Pro 
380 

lie Gin Arg Val Gly 
400 

Gin Ser Gly Gly Cys 
415 

Phe Lys Ser He Glu 
430 

Val Gly * 
445 
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CLAIMS 

I claim: 

1. A biologically pure sample of DNA, the DNA 
comprising a DNA sequence coding for the expression of 
a plant p-hydroxyphenyl pyruvic acid dioxygenase . 

2 . A vector containing the DNA sequence of 
claim 1. 

3 . A microbial host transformed by the vector 

of claim 2 . 

4. The DNA of claim 1, wherein the p- 
hydroxyphenyl pyruvic acid dioxygenase is from 
Arabidopsis thaliana, 

5. A transgenic tomato plant transformed with a 
DNA construct including the DNA of claim 1 . 

6. A transgenic Arabidopsis plant transformed 
with a DNA construct including the DNA of claim 1 

7. The biologically pure DNA of claim 1 wherein 
the DNA is SEQ ID NO : 2 . 

8. A DNA plant gene expression construct 

comprising : 

a . a DNA sequence coding for the 
expression of a plant p-hydroxyphenyl pyruvic acid 
dioxygenase ; 

b* a promoter effective in plant cells 
located 5' to the DNA coding sequence; and 

c. a 3' termination sequence effective in 
plant cells. 



25 



30 
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9. A DNA construct comprising: 

a. a promoter capable of expressing a 
downstream coding sequence in a tomato plant; 

b. a DNA sequence coding for the 
5 expression of a p-hydroxyphenyl pyruvic acid 

dioxygenase of plant origin; and 

c. a 3' termination sequence, the 
construction capable of expressing a p-hydroxyphenyl 
pyruvic acid dioxygenase gene when transformed .into 

10 tomato plants. 

10. A bacteria containing the construction of 
Claim 9 . 

11. A tomato plant cell containing the 
construction of Claim 9. 

15 12 . An Arabidopsis plant cell containing the 

construction of Claim 9. 

13 . A transgenic tomato plant comprising in its 
genome a foreign genetic construction comprising, 5' 
to 3', a promoter effective in tomato, a DNA coding 
20 region encoding p-hydroxyphenyl pyruvic acid 

dioxygenase, and a transcriptional terminator, the 
genetic construction effective in vivo in tomato 
plants to stimulate expression of p-hydroxyphenyl 
pyruvic acid dioxygenase. 

25 14 . Seed of the tomato plant of claim 13 . 

15. Fruit of the tomato plant of claim 13. 

16 . The transgenic tomato plant of claim 11 
wherein the DNA coding region encoding p-hydroxyphenyl 
pyruvic acid dioxygenase is that set forth in SEQ ID 

30 NO:l. 
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17 . A method of suppressing the production of 
vitamin E and plastoquinones in a plant, the method 
comprising the steps of: 

a. isolating a DNA sequence encoding a p- 
5 hydroxyphenyl pyruvic acid dioxygenase of plant 

origin; 

b. creating a genetic construction including, 

5' to 3', a promoter effective in the plant's cells, a 
coding sequence, and a transcriptional terminator, the 
10 coding region being derived from the DNA sequence, 
wherein the DNA sequence from step (a) has been 
altered so that expression of p-hydroxyphenyl pyruvic 
acid dioxygenase is suppressed; and 

c. transforming a cell of the plant with the 
15 genetic construction, whereby the plant's cell 

produces lowered levels of vitamin E and 
plastoquinones . 
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